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Method for Modulating Stem Cell Differentiation Using Stem Loop RNA 

The invention relates to a method to modulate stem cell differentiation comprising 
introducing stem loop containing RNA into a stem cell to ablate mRNA's which 
encode polypeptides which are involved in stem cell differentiation; stem loop 
RNA's ; and nucleic acid molecules and vectors encoding stem loop RNA's. 

A number of techniques have been developed in recent years which purport to 
specifically ablate genes and/or gene products. For example, the use of anti-sense 
nucleic acid molecules to bind to and thereby block or inactivate target mRNA 
molecules is an effective means to inhibit the production of gene products. This is 
typically very effective in plants where anti-sense technology produces a number of 
striking phenotypic characteristics. However, antisense is variable leading to the 
need to screen many, sometimes hundreds of, ' transgenic organisms carrying one or 
more copies of an antisense transgene to ensure that the phenotype is indeed truly 
linked to the antisense transgene expression. Antisense techniques, not necessarily 
involving the production of stable transfectants, have been applied to cells in culture, 
with variable results. 

In addition, the ability to be able to disrupt genes via homologous recombination has 
provided biologists with a crucial tool in defining developmental pathways in higher 
organisms. The use of mouse gene "knock out" strains has allowed the dissection of 
gene function and the probable function of human homologues to the deleted mouse 
genes, (Jordan and Zant, 1998). 

A much more recent technique to specifically ablate gene function is through the 
introduction of double stranded RNA, also referred to as inhibitory RNA (RNAi), 
into a cell which results in the destruction of mRNA complementary to the sequence 
included in the RNAi molecule. The RNAi molecule comprises two complementary 
strands of RNA (a sense strand and an antisense strand) annealed to each other to 
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form a double stranded RNA molecule. The RNAi molecule is typically derived 
from exonic or coding sequence of the gene which is to be ablated. 

Surprisingly, only a few molecules of RNAi are required to block gene expression 
5 which implies the mechanism is catalytic. The site of action appears to be nuclear as 
little if any RNAi is detectable in the cytoplasm of cells indicating that RNAi exerts 
its effect during mRNA synthesis or processing. 

The exact mechanism of RNAi action is unknown although there are theories to 
10 explain this phenomenon. For example, all organisms have evolved protective 
mechanisms to limit the effects of exogenous gene expression. For example, a virus 
often causes deleterious effects on the organism it infects. Viral gene expression 
and/or replication'therefore needs to be repressed. In addition, the rapid development 
of genetic transformation and the provision of transgenic plants and animals has led 
15 to the realisation that transgenes are alsp recognised as foreign nucleic acid and 
subjected to phenomena variously called quelling (Singer and Selker, 1995), gene 
silencing (Matzke and Matzke, 1998) , and co-suppression (Stam et. aL, 2000). 

Initial studies using RNAi used the nematode Caenorhabditis elegans. RNAi 
20 injected into the worm resulted in the disappearance of polypeptides corresponding to 
the gene sequences comprising the RNAi molecule(Montgomery et. al., 1998; Fire et. 
aL, 1998). More recently the phenomenon of KNAj inhibition has been shown in a 
number of eukaryotes including, by example and ndt by way of limitation, plants, 
trypanosomes (Shi et. al., 2000) Drosophila spp. (Kennerdell and Carthew, 2000). 
25 Recent experiments have shown that RNAi may also function in higher eukaryotes. 
For example, it has been shown that RNAi can ablate c-mos in a mouse ooctye and 
also E-cadherin in a mouse preimplanation embryo (Wianny and Zernicka-Goetz, 
2000). 

30 The use of RNAi to ablate stem cell RNA is disclosed in our co-pending application, 
WO 02/16620, which is incorporated by reference. 

2 
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During mammalian development those cells that form part of the embryo up until the 
formation of the blastocyst are said to be totipotent (e.g. each cell has ' the 
developmental potential to form a complete embryo and all the cells required to 
support the growth and development of said embryo). During the formation of the 
blastocyst, the cells that comprise the inner cell mass are said to be pluripotential 
(e.g. each cell has the developmental potential to form a variety of tissues). 

Embryonic stem cells (ES cells, those with pluripotentiality) may be principally 
derived from two embryonic sources. Cells isolated from the inner cell mass are 
termed embryonic stem (ES) cells. In the laboratory mouse, similar cells can be 
derived from the culture of primordial germ cells isolated from the mesenteries or 
genital ridges of days 8.5-12.5 post coitum embryos. These would ultimately 
differentiate into germ cells and are referred to as embryonic germ cells (EG cells). 
Each of these types of pluripotential cell has a similar developmental potential with 
respect to differentiation into alternate cell types, but possible differences in 
behaviour (eg with respect to imprinting) have led to these cells to be distinguished 
from one another . 

Typically ES/EG cell cultures have well defined characteristics. These include, but 
are not limited to; ,. : . 

i) maintenance in culture for at least 20 passages when maintained on fibroblast 
feeder layers; 

ii) produce clusters of cells in culture referred to as embryoid bodies; 

iii) ability to differentiate into multiple cell types in monolayer culture; 

iv) can form embryo chimeras when mixed with an embryo host; 

v) express ES/EG cell specific markers. 

Until very recently, in vitro culture of human ES/EG cells was not possible. The first 
indication that conditions may be determined which could allow the establishment of 
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human ES/EG cells in culture is described in W096/22362. The application 
describes cell lines and growth conditions which allow the continuous proliferation 
of primate ES cells which exhibit a range of characteristics or markers which are 
associated with stem cells having pluripotent characteristics. 

5 

More recently Thomson et al (1998) have published conditions in which human ES 
cells can be established in culture. The above characteristics shown by primate ES 
cells are also shown by the human ES cell lines. In addition the human cell lines 
show high levels of telomerase activity, a characteristic of cells which have the 

10 ability to divide continuously in culture in an undifferentiated state. Another group 
(Reubinoff et. al., 2000) have also reported the derivation of human ES cells from 
human blastocyts. Shamblott et al., 1998 have also described EG cell derivation. In 
Lake et al J Cell Science 2000, 113:555-66 and Rathjen et al J Cell Science 1999, 
1 12: 601-12, ectodermal stem cells are disclosed. The above references are each both 

1 5 incorporated by reference in their entirety. 

A feature of ES/EG cells is that, in the presence of fibroblast feeder layers, they 
retain the ability to divide in an undifferentiated state for several generations. If the 
feeder layers are removed then the cells differentiate. The differentiation is often to 
20 neurones or muscle cells but the exact mechanism by which this occurs and its 
control remain unsolved. 

In addition to ES/EG cells a number of adult tissues contain cells with stem cell 
characteristics. Typically these cells, although retaining the ability to differentiate 

25 into different cell types, do not have the pluripotential characteristics of ES/EG cells. 
For example haemopoietic stem cells have the potential to form all the cells of the 
haemopoietic system (red blood cells, macrophages, basophils, eosinophils etc). All 
of nerve tissue, skin and muscle retain pools of cells with stem cell potential. 
Therefore, in addition to the use of embryonic stem cells in developmental biology, 

30 there are also adult stem cells which may also have utility with respect to determining 
the factors which govern cell differentiation. . Further recent studies have suggested 
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that some stem cells previously thought to be committed to a single fate, (e.g 
neurons) may indeed possess considerable pluripotentcy in certain situations. Neural 
stem cells have recently been shown to chimerise a mouse embryo and form a wide 
range of non-neural tissue (Clark et al., 2000). 

A further group of cells which have relevance to developmental biology are 
pluripotent embryonal carcinoma cells (EC cells) which are stem cells of 
teratocarcinomas, also referred to as teratomas, which are able to differentiate into all 
cell types found in these tumours. A teratocarcinoma also includes teratocarcinoma 
cells which do not have the full pluripotential characteristics of an EC cell but 
nevertheless can differentiate into a restricted number of differentiated tissues. These 
cells have many features in common with ES/EG cells. The most important of these 
features is the characteristic of pluripotentiality. 

Teratomas contain a wide range of differentiated tissues, and have been known in 
humans for many hundreds of years. They typically occur as gonadal tumours of 
both men and women. The gonadal forms of these tumours are generally believed to 
originate from germ cells, and the extra gonadal forms, which typically have the 
same range of tissues, are thought to arise from germ cells that have migrated 
incorrectly during embryogenesis. Teratomas are therefore generally classed as germ 
cell tumours which encompasses a number of different types of cancer. These include 
seminoma, embryonal carcinoma, yolk sac carcinoma and choriocarcinoma. 

The similar biology of EC cells with ES/EG cells has been exploited to study the 
developmental fates of cells and to identify cell markers commonly expressed in EC 
cells and ES/EG cells. For example, and not by way of limitation, the expression of 
specific cell surface markers SSEA-3 (+), SSEA-4 (+), TRA-1-60 (+), TRA-1-81 (4-) 
(Shevinsky et al 1982; Kannagi et al 1983; Andrews et al 1984a; Thomson et al 
1995); alkaline phosphatase (+) (Andrews et. al., 1996); and Oct 4 (Scholer et al., 
1989; Kraft et. al., 1996; Reubinoff et. al., 2000; Yeom et. al., 1996). 
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We have accumulated expression studies which identify a number of genes thought 
to be involved in deter mining the developmental fate of stem cells, particularly 
embryonic stem cells. By northern blotting we have identified the expression of 
human homologs of two signalling pathways believed to be critical in cell fate 
5 determination. Expression of ligands, receptors and downstream components of the 
Notch and Wingless signalling cascades have been elucidated. Using the model 
system NTERA2/D1 embryonal carcinoma cells we have recorded changes in the 
expression of some of these components as the cells differentiate. Bearing in mind 
the role these cascades play in embryonic development throughout the animal 

10 kingdom, these changes suggest a significant role for both the wingless and Notch 
signalling pathways in differentiation of stem cells. Furthermore the activity of some 
genes are required for differentiation to occur along specific pathways e.g. the 
myogenic gene MyoDl. Other genes have activity which inhibits cellular 
differentiation along particular pathways. We envisage regulation of stem cell 

1 5 differentiation to yield a specific cell type could be achieved by: 

inhibition of certain genes that normally promote differentiation along 
particular pathways; therefore promoting differentiation to alternate cell 
phenotypes; 

inhibition of gene activity that prevents differentiation into particular cell 
types; and 

a combination of (i) and (ii), see figure 1 

25 In our co-pending application, WO02/16620, we introduce RNAi molecules 
homologous to genes encoding factors involved in stem cell differentiation. The 
differentiation of stem cells during embryogenesis, during tissue renewal in the adult 
and wound repair is under very stringent regulation; aberrations in this regulation 
underlie the formation of birth defects during development and are thought to 

30 underlie cancer formation in adults. 
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Generally, it is envisaged that stem cells are under both positive and negative 
regulation which allows a fine degree of control over the process of cell proliferation 
and cell differentiation: excess proliferation at the expense of cell differentiation can 
lead to the formation of an expanding mass of tissue - a cancer — whereas express 
differentiation at the expense of proliferation can lead to the loss of stem cells and 
production of too little differentiated tissue in the long term, and especially the loss 
of regenerative potential. Certain genes have already been identified to have a 
negative role in preventing stem cell differentiation. Such genes, like those of the 
Notch family, when mutated to acquire activity can inhibit differentiation; such 
mutant genes act as oncogenes. On the contrary, loss of function of such genes on 
their inhibition results in stem cell differentiation. 

We propose to use EC cells has a model cell system to follow the effects of 
perturbations in stem cell differentiation. We further propose an alternative approach 
to introduce double stranded RNA molecules into stem cells to ablate mRNA's. 

The invention relates to the provision of stem-loop RNA structures which can either 
be synthesised in vitro followed by transfection into a stem cell, or alternatively, 
synthesised in vivo by the stem cell from vectors which are provided with expression 
cassettes which include a DNA molecule which includes the coding sequence for the 
stem-loop RNA. 

'v 

\ 

The DNA molecule encoding the stem-loop RNA is constructed in two parts, a first 
part which is derived from a gene the regulation of which is desired. The second part 
is provided with a DNA sequence which is complementary to the sequence of the 
first part. The cassette is typically tinder the control of a promoter which transcribes 
the DNA into KNA. The complementary nature of the first and second parts of the 
RNA molecule results in base pairing over at least part of the length of the RNA 
molecule to form a double stranded hairpin RNA structure or stem-loop. The first 
and second parts can be provided with a linker sequence. 
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According to a first aspect of the invention there is provided a method to modulate 
the differentiation state of a stem cell comprising: 

(i) contacting a stem cell with at least one nucleic acid molecule comprising a 
5 sequence of a gene which mediates at least one step in the differentiation of said cell 
which nucleic acid molecule consists of a first part linked to a second part wherein 
said first and second parts are complementary over at least part of their length and 
further wherein said first and second parts form a double stranded region by 
complementary base pairing over at least part of their length; 
10 (ii) providing conditions conducive to the growth and differentiation of the cell 
treated in (i) above; and optionally 

(iii) maintaining and/or storing the cell in a differentiated state. 

In a preferred method of the invention said first and second parts are linked by at 
15 least one nucleotide base. 

The provision of first and second sequences which are complementary to one another 
and which comprise at least part of the coding sequence of a gene involved in stem 
cell differentiation means that when the sequence is transcribed into RNA the 

20 complementarity between first and second sequences allows base pairing between 
first and second sequences to form a double stranded RNA structure, see Figure 1. 
The optional provision of a linking region bewteen first and second parts results in 
the formation of a so called 'liair-pin" loop structure. The transcription of the 
nucleic acid provides many copies of the hair-pin loop RNA which effectively 

25 functions as a RNAi molecule. 

In a preferred method of the invention said nucleic acid molecule is a stem loop RNA 
molecule. Alternatively, said nucleic acid molecule is a DNA molecule which 
encodes said stem loop RNA. Ideally said DNA molecule is a vector adapted for 
30 expression of said stem loop RNA. 

8 
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In a preferred method of the invention said conditions are in vitro cell culture 
conditions. 

5 

In a further preferred method of the invention said stem cell is selected from: 
pluripbtent stem cells such as embryonic stem cell; embryonic germ cell and 
embryonal carcinoma cells; and lineage restricted stem cells such as, but not 
restricted to; haemopoietic stem cell; muscle stem cell; nerve stem cell; skin dermal 
10 sheath stem cell; liver stem cell; and teratocarcinoma cells. 

It will be apparent that the method can provide stem cells of intermediate 
commitment. For example, embryonic stem cells could be programmed to 
differentiate into haemopoietic stems cells with a restricted commitment. 
15 Alternatively, differentiated cells or stem cells of intermediate commitment could be 
reprogrammed to a more pluripotehtial state from which other differentiated cell 
lineages can be derived. 

In a further preferred method of the invention said stem cell is an embryonic stem 
20 cell or embryonic germ cell. 

In a yet -further preferred method of the invention said stem loop RNA molecule is 
derived from a gene which encodes a cell surface receptor expressed by a stem cell. 

25 In a further preferred method of the invention said cell surface receptor is selected 
from: human Notch l(hNotch 1); hNotch 2; hNotch 3; hNotch 4; TLE-1; TLE-2; 
TLE-3; TLE-4; TCF7; TCF7L1; TCFFL2; TCF3; TCF19; TCF1; mFringe; lFringe; 
rFringe; sel 1; Numb; Numblike; LMX; FZD1; FZD2; FZD3; FZD4; FZD5; FZD6; 
FZD7; FZD8; FZD9; FZD10; FRZB. 

30 
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In an alternative preferred method of the invention said stem loop RNA molecule is 
derived from a gene which encodes a ligand. 

Typically, a ligand is a polypeptide which hinds to a cognate receptor to induce or 
5 inhibit an intracellular or intercellular response, tigands may be soluble or 
membrane bound. 

In a further alternative preferred method of the invention said ligand is selected from: 
Dl 1-1; D113; Dl 14; Dlk-1; Jagged 1; Jagged 2; Wnt 1; Wnt 2; Wnt 2b; Wnt 3; Wnt 
10 3a; WntSa; Wnt6; Wnt7a; Wnt7b; Wnt8a; Wnt8b; WntlOb; Wntll; Wntl4; WntlS. 

Alternatively, said gene is selected from: SFRP1; SFRP2; SFRP4; SFRP5; SK; 
DKK3; CER1; WIF-1; DVL1; DVL2; DVL3; DVLlLl;mFringe; lFringe; rFringe; 
selll; Numb; LNX Oct4;NeuroDl; NeuroD2; NeuroD3; Brachyury; MDFL 

15 

In a further preferred method of the invention said stem loop RNA molecule is 
derived from at least one of the sequences identified in Table 4 or Figures 4-54. 

In a yet futher preferred embodiment of the invention said sequence is derived from 
20 Oct 4. Preferably the Oct 4 sequence corresponds to nucleotide sequence about 610 
to about 1032 of the Oct 4 sequence found in GenBank accession number NM_ 
002701. 

Many methods have been developed over the last 30 years to facilitate the 
25 introduction of nucleic acid into cells which are well known in the art and are 
applicable to the stem loop RNA structures disclosed herein or the vectors which 
encode said stem loop structures. 

Methods to introduce nucleic acid into cells typically involve the use of chemical 
30 reagents, cationic lipids or physical methods. Chemical methods which facilitate the 
uptake of DNA by cells include the use of DEAE -Dextran ( Vaheri and Pagano 
Science 175: p434) . DEAE-dextran is a negatively charged cation which associates 

10 
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and introduces the nucleic acid into cells. Calcium phosphate is also a commonly 
used chemical agent which when co-precipitated with nucleic acid introduces the 
nucleic acid into cells (Graham et al Virology (1973) 52: p456). 

The use of cationic lipids (eg liposomes ( Feigner (1987) Proc.Natl.Acad.Sci USA, 
84:p7413) has become a common method. The cationic head of the lipid associates 
with the negatively charged nucleic acid backbone to be introduced. The lipid/nucleic 
acid complex associates with the cell membrane and fuses with the cell to introduce 
the associated nucleic acid into the cell. Liposome mediated nucleic acid transfer has 
several advantages over existing methods. For example, cells which are recalcitrant 
to traditional chemical methods are more easily transfected using liposome mediated 
transfer. 

More recently still, physical methods to introduce nucleic acid have become effective 
means to reproducibly.transfect cells. Direct microinjection is one such method 
which can deliver nucleic acid directly to the nucleus of a cell ( Capecchi (1980) 
Cell, 22:p479). This allpws the analysis of single cell transfectants. So called 
"biolistic" methods physically shoot nucleic acid into cells and/or organelles using a 
particle gun ( Neumann (1982) EMBO J, 1: p841). Electroporation is arguably the 
most popular method to transfect nucleic acid. The method involves the use of a 
high voltage electrical charge to momentarily permeabilise cell membranes making 
\ them permeable to macromolecular complexes. \ 

More recently still a method termed immunoporation has become a recognised 
technique for the introduction of nucleic acid into cells, see Bildirici et al Nature 
(2000) 405, p298. The technique involves the use of beads coated with an antibody 
to a specific receptor. The transfection mixture includes nucleic acid, antibody coated 
beads and cells expressing a specific cell surface receptor. The coated beads bind the 
cell surface receptor and when a shear force is applied to the cells the beads are 
stripped from the cell surface. During bead removal a transient hole is created 
through which nucleic acid and/or other biological molecules can enter. Transfection 
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efficiency of between 40-50% is achievable depending on the nucleic acid used. In 
addition the specificity of cell delivery of RNAi's can be enhanced by association or 
linkage of the RNAi to specific antibodies, ligands or receptors. 

There are also a number of commercially available transfection kits which purport to 
provide high efficiency transfection of cells. A kit which is particularly preferred is 
sold under the tradename ExGen 500*™ by MBI Fermentas, Lithuania. ExGen is a 
polyethylenimine, nqn-liposomal transfection reagent 

According to a further aspect of the invention there is provided a stem loop RNA 
molecule derived from a coding sequence of at least one gene involved in stem cell 
differentiation comprising a first part linked to a second part wherein* said first and 
second parts are complementary over at least part of their length and further wherein, 
said first and second parts form a double stranded region by complementary base 
pairing over at least part of their length. 

In a preferred embodiment of the invention said first and second parts are linked by at 
least one nucleotide base. In a further preferred embodiment of the invention said 
first and second parts are linked by 2, 3, 4, 5, 6, 7, 8, 9, or 10 nucleotide bases. In a 
yet further preferred embodiment of the invention said linker is at least 10 nucleotide 
bases. 

In a preferred embodiment said coding sequence is an exon. 

Alternatively said RNA molecule is derived from intronic sequences or the 5 ? and/pr 
3 1 non-coding sequences which flank coding/exon sequences of genes which mediate 
stem cell differentiation. 

hi a fiirther preferred embodiment of the invention the length of the KNA molecule is 
between 10 nucleotide bases (rib) -lOOOnb. More preferably still the length of the 
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RNA molecule is selected from lOnb; 20nb; 30nb; 40nb; 50nb; 60nb; 70nb; 80nb; 
90nb. More preferably still said RNA molecule is 21nb in length. 

In a further preferred embodiment of the invention said RNA molecule is lOOnb; 
5 200nb; 300nb; 400nb; 500nb; 600nb; 700nb; 800nb; 900nb; or lOOOnb. More 
preferably still said RNA molecule is at least lOOOnb. 

In a further preferred embodiment of the invention said RNA molecule comprises 
sequences identified in Table 4 or Figures 4-54. 

10 

In yet a further preferred embodiment of the invention said RNA molecules comprise 
modified nucleotide bases. 

It will be apparent to one skilled in the art that the inclusion of modified bases, as 
15 well as the naturally occuring bases cytosine, uracil, adenosine and guanosine, may 
confer advantageous properties on RNA molecules containing said modified bases. 
For example, modified bases may increase the stability of the RNA molecule thereby 
reducing the amount required to produce a desired effect. The provision of modified 
bases may also provide stem-loop structures which are more or less stable. 

20 

According to a further aspect of the invention there is provided a nucleic acid 
molecule encoding at least part of a gene which mediates at least one step in stem cell 
differentiation comprising a first part linked to a second part which first and second 
parts are complementary over at least part of their length, wherein said nucleic .acid 
25 molecule is operably linked to at least one further nucleic acid molecule capable of 
promoting transcription of said nucleic acid linked thereto and further wherein said 
first and second parts form a double stranded region by complementary base pairing 
over at least part of their length as or when said nucleic acid molecule is transcribed. 

30 In a preferred embodiment of the invention said first and second parts are linked by 
linking nucleotides as hereinbefore described. 

13 
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It will be apparent to one skilled in the art that the synthesis of RNA molecules which 
form RNA stem loops can be achieved by providing vectors which include target 
genes, or fragments of target genes, operably linked to promoter sequences. 
5 Typically, promoter sequences are phage RNA polymerase promoters (eg T7, T3, 
SP6). Advantageously vectors are provided with multiple cloning sites into which 
genes or gene fragments can be subcloned. Typically, vectors are engineered so that 
phage promoters flank multiple cloning sites containing the gene of interest. 

10 Alternatively target genes or fragments of target genes can be fused directly to phage 
promoters by creating chimeric promoter/gene fusions via oligo synthesising 
technology. Constructs thus created can be easily amplified by polymerase chain 
reaction to provide templates for the manufacture of RNA molecules comprising 
stem loop RNA's. 

15 

According to a further aspect of the invention there is provided a vector including an 
expression cassette comprising a first sequence linked to a second sequence wherein 
said first and second sequences are complementary over at least part of their lengths 
and further wherein the expression cassette is transciptionally linked to a promoter 
20 sequence. 

In a preferred embodiment of the invention said first and second parts are linked by 
linking nucleotides as hereinbefore described. 

25 Vectors including expression cassettes encoding stem-loop RNA's are adapted for 
eukaryotic gene expression. Typically said adaptation includes, by example and not 
by way of limitation, the provision of transcription control sequences (promoter 
sequences) which mediate cell/tissue specific expression. These promoter sequences 
may be cell/tissue specific, inducible or constitutive. 

30 
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Promoter elements typically also include so called TATA box and RNA polymerase 
initiation selection sequences which function to select a site of transcription 
initiation. These sequences also bind polypeptides which function, inter alia, to 
facilitate transcription initiation selection by RNA polymerase. 

5 

Adaptations also include the provision* of selectable markers 
and autonomous replication sequences which botli facilitate the maintenance of said 
vector in either the eukaryotic cell or prokaryotic host. Vectors which are maintained 
autonomously are referred to as episomal vectors. Further adaptations which 
10 facilitate the expression of vector encoded genes include the provision of 
transcription termination sequences. 

These adaptations are well known in the art. There is a significant amount of 
published literature with respect to expression vector construction and recombinant 
15 DNA techniques in general. Please see, Sambrook et al (1989) Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbour Laboratory, Cold Spring Harbour, NY and 
references therein; Marston, F (1987) DNA Cloning Techniques: A Practical 
Approach Vol m TRL Press, Oxford UK; DNA Cloning: F M Ausubel et al, Current 
Protocols in Molecular Biology, John Wiley & Sons, Inc.(1994). 

20 

According to a further aspect of the invention there is provided a cell transfected with 
the nucleic acid or vector according to the invention. Preferably said, cell is an 
embryonic stem cell or embryonic germ cell. Alternatively said cell is an embryonal 
carcinoma cell. 

25 

According to a further aspect of the invention there is provided a method to 
manufacture stem loop RNA molecules comprising: 

(i) providing a vector or promoter/gene fusion according to the invention; 

30 
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(ii) providing reagents and conditions which allow the synthesis of the RNA 
molecule comprising a stem loop RNA molecule according to the invention; and 

(iii) providing conditions which allow the RNA molecule to base pair over at least 
5 part of its length, or at least that part corresponding to the nucleic acid sequence 

encoding said stem cell gene which mediates stem cell differentiation. 

Preferably said gene, or gene fragment is selected from those genes represented in 
table 4 or Figures 4-54. 

10 

In vifro transcription of RNA is an established methodology. Kits are commercially 
available which provide vectors, ribonucleoside triphosphates, buffers, Rnase 
inhibitors, RNA polymerases (eg phage T7, T3, SP6) which facilitate the production 
of RNA. 

15 

According to a further aspect of the invention there is provided an in vivo method to 
promote the differentiation of stem cells comprising administering to an animal an 
effective amount of stem loop RNA molecule, or vector encoding a stem loop RNA 
molecule according to the invention, sufficient to effect differentiation of a target 
20 stem cell. 

Preferably said method promotes differentiation in vivo of endogenous stem cells to 
repair tissue damage in situ. 

25 It will be apparent to one skilled in the art that stem loop RNA relies on homology 
between the target gene RNA and double stranded region of the stem loop in a 
similar way to conventional RNAi. This confers a significant degree of specificity to 
the stem loop RNA molecule in targeting stem cells. For example, haemopoietic 
stem cells are found in bone marrow and stem loop RNA molecules may be 

30 administered to an animal by direct injection into bone marrow tissue. 
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Stem loop RNA molecules may be encapsulated in liposomes to provide protection 
from an animals immune system and/or nucleases present in an animals serum. 

Liposomes are lipid based vesicles which encapsulate a selected therapeutic agent 
5 which is then introduced into a patient. Typically, the liposome is manufactured 
either from pure phospholipid or a mixture of phospholipid and phosphoglyceride. 
Typically liposomes can be manufactured with diameters of less than 200nm, this 
enables them to be intravenously injected and able to pass through the pulmonary 
capillary bed. Furthermore the biochemical nature of liposomes confers 

10 permeability across blood vessel membranes to gain access to selected tissues. 
Liposomes do have a relatively short half-life. So called STEALTH R liposomes have 
been developed which comprise liposomes coated in polyethylene glycol (PEG). The 
PEG treated liposomes have a significantly increased half-life when administered 
intravenously to a patient. In addition STEALTH R liposomes show reduced uptake 

15 in the reticuloendothelial system and enhanced accumulation selected tissues. In 
addition, so called immuno-liposomes have been develop which combine lipid based 
vesicles with an antibody or antibodies, to increase the specificity of the delivery of 
the RNAi molecule to a selected cell/tissue. 

20 The use of liposomes as delivery means is described in US5580575 and US 5542935. 

It will be apparent to one skilled in the art that the stem loop RNA molecules can be 
provided in the form of an oral or nasal spray, an aerosol, suspension, emulsion, 
and/or eye drop fluid. Alternatively the stem loop RNA molecules may be provided 
in tablet form. Alternative delivery means include inhalers or nebulisers. 

25 

According to a yet further aspect of the invention there is provided a therapeutic 
composition comprising a stem loop RNA molecule according to the invention or a 
vector encoding a stem loop RNA according to the invention. 

30 Preferably said stem loop RNA molecule or vector is for use in the manufacture of a 
medicament for use in promoting the differentiation of stem cells to provide 
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differentiated cells/tissues to treat diseases where cell/tissues are destroyed by said 
disease. 

Typically this includes pernicious anemia; stroke, neurodegenerative diseases such as 
5 Parkinson's disease, Alzhiemer's disease; coronary heart disease; cirrhosis; 
diabetes. It will also he apparent that differentiated stem cells may be used to replace 
nerves damaged as a consequence of (eg replacement of spinal cord tissue). 

In a further preferred embodiment of the invention said therapeutic composition 
10 further comprises a diluent, carrier or excipient. 

According to a further aspect of the invention there is provided a cell obtainable by 
the method according to the invention. 

15 It will be apparent that a cell obtainable by the method according to the invention has 
useful applications . For example, a stably transfected cell under the control of a 
regulatable promoter (ie inducible, repressive, developmentally regulated, cell 
lineage regulated, cell-cycle regulated) offers the opportunity to modulate the 
expression of the stem-loop RNA in said cell thereby modulating the differentiation 

20 state, or not as the case maybe, in culture or in viyo. 

According to a yet further aspect of the invention mere is provided at least one organ 
comprising at least one cell obtainable by the method according to the invention. 

25 According to a yet further aspect of the invention there is provided a non-human 
transgenic animal comprising a RNA molecule according to the invention, or a 
nucleic acid molecule according to the invention, or a vector according to the 
invention. 

30 An embodiment of the invention will now be described by example only and with 
reference to the following figures and tables wherein: 
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Table 1 represents a selection of antibodies used to monitor stem cell differentiation; 

Table 2 represents nucleic acid probes used to assess mRNA markers of stem 
5 differentiation; 

Table 3 represents protein markers of stem cell differentiation; 

10 Table 4 represents specific primers used to generate stem loop RNA for gene 
specific inhibition; 

Table 5 represents vectors used for the expression of stem loop RNA in cells 
including the promoters used to drive transcription of stem loop RNA's. 

15 

Figure 1 illustrates stem cell differentiation is controlled by positive and negative 
regulators (A). The specific cell phenotypes that are derived are a direct result of 
positive and negative regulators which activate or suppress particular differentiation 
events. Stem loop RNA can be used to control both the initial differentiation of stem 
20 cells (A) and the ultimate fate of the differentiated cells Dl and D2 by repression of 
positive activators which would normally promote a particular cell fate; 

Figure 2 represents the Oct 4 nucleic acid sequence from position 610-1032 of the 
sequence found in GenBank accession number NMT002701. 

25 

Fig 3A illustrates a transcription cassette comprising a promoter sequence operable 
linked to a nucleic acid encoding a stem loop RNA; Fig 3B illustrates a stem loop 
RNA synthesised from the cassette illustrated in Fig 1 A; 

30 Figure 4 is the nucleic acid sequence of murine notch ligand delta-like 1; 

Figure 5 is the nucleic acid sequence of murine notch ligand jagged 1; 
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Figure 6 is the nucleic acid sequence of human notch ligand jagged 1 (alagille 
syndrome) (JAG1); 

Figure 7 is the nucleic acid sequence of human notch ligand jagged 2 (JAG2) 

5 

Figure 8 is the nucleic acid sequence of murine notch ligand jagged 2; 
Figure 9 is the nucleic acid sequence of human notch ligand delta-like 3 (DLL3); 
10 Figure 10 is the nucleic acid sequence of human notch ligand delta- 1 (DLL1); 

Figure 1 1 is the nucleic acid sequence of human notch ligand delta-like 4 (DLL4); 
Figure 12 is the nucleic acid sequence of murine notch ligand delta-like 4(DLL4); 



15 



Figure 13 represents the nucleic acid sequence of human Writ 13; 



Figure 14 represents the nucleic acid sequence of human dickkopfl; 



20 



Figure 15 represents the nucleic acid sequence of human dickkop/2; 



Figure 16 represents the nucleic acid sequence of human dickkop/3; and 



Figure 17 represents the nucleic acid sequence of human dickkopf4; 



25 



Figure IS represents the nucleic acid sequence of WNT-1; 



Figure 19 represents the nucleic acid sequence of WNT-2; 



30 



Figure 20 represents the nucleic acid sequence of WNT 2B; 
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Figure 21 represents the nucleic acid sequence of WNT 3; 
Figure 22 represents the nucleic acid sequence of WNT 4; 
5 Figure 23 represents the nucleic acid sequence of WNT 5 A; 
Figure 24 represents the nucleic acid sequence of WNT 6; 
Figure 25 represents the nucleic acid sequence of WNT 7A; 

10 

Figure 26 represents the nucleic acid sequence of WNT 8B; 
Figure 27 represents the nucleic acid sequence of WNT 10B; 
15 Figure 28 represents the nucleic acid sequence of WNT 11; 
Figure 29 represents the nucleic acid sequence of WNT 14 
Figure 30 represents the nucleic acid sequence of WNT 16; 
Figure 31 represents the nucleic acid sequence of FZD 1; 
Figure 32 represents the nucleic acid sequence of FZD 2; 
25 Figure 33 represents the nucleic acid sequence of FZE 3; 
Figure 34 represents the nucleic acid sequence of FZD 4; 
Figure 35 represents the nucleic acid sequence of FZD 5; 

30 

Figure 36 represents the nucleic acid sequence of FZD 6; 

21 
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Figure 37 represents the nucleic acid sequence of FZD 7; 
Figure 38 represents the nucleic acid sequence of FZD 8; 

5 

Figure 39 represents the nucleic acid sequence of FZD 9; 
Figure 40 represents the nucleic acid sequence of FZD 10; 
10 Figure 41 represents the nucleic acid sequence of FRP; 

Figure 42 represents the nucleic acid sequence of SARP 1; 
Figure 43 represents the nucleic acid sequence of SARP 2; 

15 

Figure 44 represents the nucteic acid sequence of FRZB; 
Figure 45 represents the nucleic acid sequence of FRPHE; 
20 Figure 46 represents the nucleic acid sequence of SARP 3; 

Figure 47 represents the nucleic acid sequence of CER 1; \ 
Figure 48 represents the nucleic acid sequence of DKK1; 

25 

Figure 49 represents the nucleic acid sequence of DKK 2; 
Figure 50 represents the nucleic acid sequence of DKK 3; 
30 Figure 51 represents the nucleic acid sequence of DKK 4; 
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Figure 52represents the nucleic acid sequence of WIF-1; 
Figure 53 represents the nucleic acid sequence of SRFP 1; 
5 Figure 54 represents the nucleic acid sequence of SRFP 4; 



10 



15 Materials and Methods 
Cell Culture 

NTERA2 and 2102Ep human EC cell lines were maintained at high cell density as 
previously described (Andrews et al 1982, 1984b), in DMEM (high glucose 
20 formulation) (DMEM) (GIB CO BRL), supplemented with 10% v/v bovine foetal calf 
serum (GB3CO BRL), under a humidified atmosphere with 10% CO2 in air. 

Stem Loop RNA Production 

25 Primers were designed against specific target genes with 17 bacteriophage promoters 
at their 5* ends . The primers consist of typically 18- 25 bp against the target gene, a 
linker sequence of variable length (indicated by N in primer sequence) followed by 
the reverse complement of the gene specific sequence. The primers were used in a 
. standard RNA in vitro, transcription reaction using a MEGASCREPT kit following 

30 manufacturers protocols (Ambion, USA). Longer slRNA templates were produced 
buy cloning head-to —tail the sense and anti-sense gene specific sequences to generate 
a palindromic template from which RNA could be synthesized. 

The following primers were used 

35 
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Gene 


Accession 
XNumDer 


Primer Sequence 


Oct4 


Z11899 


TAA TAC GAC TCA CTA TAG 
Ggagcagcttgggctcgagaag(N)cttctcgagcccaagctgctc 


HsNotch2 




TAA TAC GAC TCA CTA TAGGt cgt gca aga gcc 
agt tac cc(N)gg gta act ggc tct tgcacg a 


HsNotchl 


M73980 


TAA TAC GAC TCA CTA TAGGa atg gtc aat gcg 
agt ggc tgt cc(N)gg aca gcc act cgc gtt gac cat t 


CIF 




TAA TAC GAC TCA CTA TAGGa gta gtg aga gtg 
aga gta aca(N)tgt tac tct cac tct cac tac t 


RBPJ-kappa 




TAA TAC GAC TCA CTA TAGGt cctgtg cctgtg gta 
gag a(N)t etc tac cac agg cac agg a 


Dlkl 


NMJ)02226 


TAA TAC GAC TCA CTA TAGGcctc ttg etc ctg ctg 
get tt(N)aaagccagcaggagcaagagg 



Capital letters indicate the T7 polymerase promoter sequence. 

In each case, a quantity of the PCR was electrophoresed through agarose to verify 
product size and abundance, whilst the remainder was purified by alkaline 
phenol/chloroform extraction. RNA was synthesized using the Megascript kit 
(Ambion Inc.) according to the manufacturer's protocol and acid phenol/chloroform 
extracted. The simultaneous synthesis of complementary strands of RNA in a single 
reaction circumvents the requirement for an annealing step. However, the quality and 
duplexing of the synthesized RNA was confirmed by agarose gel electrophoresis, 
with the desired products migrating as expected for double stranded DNA of the 
same length. 

Stem Loop RNA introduction to Cell Lines 

Human EC stem cells were seeded at 2 XI 0 5 cells/well of a 6 well plate in 3 cm 3 of 
Dulbecco's modified Eagles medium and allowed to settle for 3 hrs. 
Appx. 9.5)Lig of DNA was incubated with an optimised amount of ExGEN 500 for 
each well of a 6-well plate. Previously cells were seeded 1 day before. This gives 
apprx. a 70% confluent culture. The DNA/ExGen mixture was added to the cells and 
the culture vessel spun at 280g for 5 mins. 

Total RNA production 



24 



WO 03/012082 



PCT/GB02/03409 



Growing cultures of cells were aspirated to remove the DME and foetal calf serum. 
Trace amounts of foetal calf serum was removed by washing in Phosphate-buffered 
saline. Fresh PBS was added to the cells and the cells were dislodged from the 
5 culture vessel using acid washed glass beads. The resulting cell suspension was 
centrifuged at 300xg. The pellets had the PBS aspirated from them. Tri reagent 
(Sigma, USA) was added at 1ml per 10 7 cells and allowed to stand for 10 mins at 
room temperature. The lysate from this reaction was centrifuged at 12000 x g for 15 
minutes at 4°C. The resulting aqueous phase was transferred to a fresh vessel and 
10 0.5 ml of isopropanol / ml of trizol was added to precipitate the RNA. The RNA was 
pelleted by centrifugation at 12000 x g for 10 mins at 4°C. The supernatant was 
removed and the pellet washed in 70% ethanol. The washed RNA was dissolved in 
DEPC treated double-distilled water. 

15 Analysis of the differentiation of EC stem cells induced by exposure to Stem Loop 
RNA 

Following exposure to stem lpop RNA corresponding to specific key regulatory 
genes, the subsequent differentiation of the EC cells was monitored in a variety of 
20 ways. One approach was to monitor the disapearance of typical markers of the stem 
cell phenotype; the other was to monitor the appearance of markers pertinent to the 
specific lineages induced. The relevant markers included surface antigens, mRNA 
species and specific proteins. 

25 Analysis of Transfectants by Antibody Staining and FACS 

Cells were treated with trypsin (0.25% v/v) for 5 mins to disaggregate the cells; they 
were washed and re-suspended to 2xl0 5 cells/ml. This cell suspension was incubated 
with 50jil of primary antibody in a 96 well plate on a rotary shaker for 1 hour at 4°C. 
30 Supernatant from a myeloma cell line P3X63Ag8, was used as a negative control. 
The 96 well plate was centrifuged at lOOrpm for 3 minutes. The plate was washed 3 
times with PBS containing 5% foetal calf serum to remove unbound antibody. Cell 
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were then incubated with 50 jil of an appropriate FITC-conjugated secondary 
antibody at 4°C for 1 hour. Cells were washed 3 times in PBS + 5% foetal calf 
serum and analysed using an EPICS elite ESP flow cytometer (Coulter eletronics, 
U.K).(Andrews et. al„ 1982) 

5 

Northern blot Analysis of RNA 

RNA separation relies on the generally the same principles as standard DNA but with 
some concessions to the tendancy of RNA to hybridise with itself or other RNA 
molecules. Formaldehyde is used in the gel matrix to react with the amine groups of 

10 the RNA and foim Schiff bases. Purified RNA is run out using standard agarose gel 
electrophoresis. For most RNA a 1% agarose gel is sufBciant. The agarose is made in 
IX MOPS buffer and supplemeted with 0.66M formaldehyde.Dryed down RNA 
samples are reconstituted and denatured in RNA loading buffer and loaded into the 
gel. Gels are run out for apprx. 3 hrs (until the dye front is 3/4 of the way down the 

15 gel). 

The major problem with obtaining clean blotting using RNA is the presence of 
formaldehyde. The run out gel was soaked in distilled water for 20 mins with 4 
changes, to remove the formaldehyde from the matrix. The transfer assembly was 
assembled in exactly the same fashion as for DNA (Southern) blotting.The transfer 
buffer used however was 10X SSPE. Gels were transfered overnight. The membrane 
was soaked in 2X SSPE to remove any agarose from the transfer assembly and the 
RNA was fixed to the memebrane. Fixation was acheived using short-wave (254 nM) 
UV light. The fixed membrane was baked for 1-2 hrs to drive off any residual 
formaldehyde. 

Hybridisation was acheived in aqueous phase with formamide to lower the 
hybridisation temperatures for a given probe. RNA blots were prehybridised for 2-4 
hrs in northern prehybridisation soloution. Labelled DNA probes were denatured at 
30 95°C for 5 mins and added to the blots. All hybridisation steps were carried out in 
rolling bottles in incubation ovens. Probes were hybridised overnight for at least 16 

26 
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hrs in the prehybridisation soloution. A standard set of wash soloutions were used. 
Stringency of washing was acheived by the use of lower salt containing wash buffers. 
The following wash procedure is outlined as follows 
2X SSPE 15 mins room temp 

5 2XSSPE 15 mins room temp 

2X SSPE/ 0.1% SDS 45 mins 65°C 

2X SSPE/ 0.1% SDS 45 mins 65°C 

0. IX SSPE 15 mins room temp 

10 Preparation of radiolabeled PNA probes 

The method of Feinberg and Vogelstein (Feinberg and Vogelstein, 1983) was used to 
radioactively label DNA. Briefly, the protocol uses random sequence hexanucleotides 
to prime DNA synthesis at numerous sites on a denatured DNA template using the 

15 Klenow DNA polymerase I fragment. Pre-formed kits were used to aid consistency . 
5-100ng DNA fragment (obtained from gel purifcation of PCR or restriction digests) 
was made up in water^denatured for 5 mins at 95°C with the random hexamers. The 
mixture was quench cooled on ice and the following were added, 
5 |j [a-32P] dATP 3000 Ci/mmol 

20 1 yJ of Klenow DNA polymerase (4U) 

The reaction was then incubated at 37°C for 1 hr. Unincorporated nucleotide were 
removed with spin columns (Nucleon Biosciences). 



25 



Production of cDNA 



The enzymatic conversion of RNA into single stranded cDNA was achieved using 
the 3 9 to 5' polymerase activity of recombinant Moloney-Murine Leukemia Virus 
(M-MLV) reverse transcriptase primed with oligo (dT) and (dN) primers. For 
Reverse Transcription-Polymerase Chain Reaction, single stranded cDNA was used. 
30 cDNA was synthesised from ljag poly (A)+ RNA or total RNA was incubated with 
the following 

1 .OpM oligo(dT) primer for total RNA or random hexcamers for mRNA 
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0.5mM lOmMdNfTPmix 

lU/pl RNAse inhibitor (Promega) 

LOU/jjI M-MLV reverse transcriptase in manufacturers supplied buffer 

(Promega) 

5 The reaction was incubated for 2-3 hours at 42°C 
Fluorescent Automated Sequencing 

• 

To check the specificity of the PCR primers used to generate the template used in 
stem loop RNA production automatic sequencing was carried out using the prism 

10 fluorescently labelled chain terminator sequencing kit (Perkin-Elmer) (Prober et a! 
1987). A suitable amount of template (200ng plasmid, lOOng PCR product), 10 jjM 
sequencing primer (typically a 20mer with 50% G-C content) were added to 8 pi of 
prism pre-mix and the total reaction volume made up to 20 jiL 24 cycles of PCR 
(94°C for 10 seconds, 50°C for 10 seconds, 60°C for 4 minutes). Following thermal 

15 cycling, products were precipitated by the addition of 2|al of 3M sodium acetate and 
50 jxl of 100 % ethanol. DNA was pelleted in an Eppendorf microcentrifuge at 13000 
rpm, washed once in 70% ethanol and vacuum dried. Samples were analysed by the 
in-house sequencing Service (Krebs Institute). Dried down samples were 
resuspended in 4 jil of formamide loading buffer, denatured and loaded onto a ABI 
20 373 automatic sequencer. Raw sequence was collected and analysed using the ABI 
prism software and the results were supplied in the fonn of analysed histogram 
traces. 

Detection of specific protein targets by SDS-PAGE and Western Blotting 

25 

To obtain cell lysates monolayers of cells were rinsed 3 times with ice-cold PBS 
supplemented with 2 mM CaCb. Cells were incubated with 1 ml/75 cm 2 flask lysis 
buffer (1% v/v NP40, 1% v/v DOC, 0.1 mM PMSF in PBS) for 15 min at 4°C. Cell 
lysates were transferred to eppendorf tubes and passed through a 21 gauge needle to 
30 shear the DNA. This was followed by freeze thawing and subsequent centrifugation 
(30 min, 4°C, 15000^) to remove insoluble material. Protein concentrations of the 
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supernatants were determined using a commercial protein assay (Biorad). Samples 
were prepared for SDS-PAGE by adding 6 times Laemmli electrophoresis sample 
buffer and boiling for 5 min. After electrophoresis with 16 jig of protein on a 10% 
polyacrylamide gel (Laemmli, 1970) the proteins were transferred to PVDF 
5 membrane. The blots were washed with PBS and 0.05% Tween (PBS-T). Blocking 
of the blots occurred in 5% milk powder in PBS-T (60 min, at RT). Blots were 
incubated with the appropriate primary antibody. Horseradish peroxidase labelled 
secondary antibody was used to visualise antibody binding by ECL (Amersham, 
Bucks., UK). Materials used for SDS-PAGE and western blotting were obtained from 
10 Biorad (California, USA) unless stated otherwise. 

Table 1: Antibodies used to detect stem cell differentiation 



Antibody 


Class 


Species 


Cell 

phenotype 
detected 


Changes on 

Differentiatio 

n 


Reference 


TRA-1- 
60 


IgM 


Mouse 


Human EC, 
ES cells. 


I 

differentiation 


Andrews eta!., 
1984a 


TRA-1- 
81 


IgM 


Mouse 


Human EC, 
ES cells. 


i 

differentiation 


Andrews et. 
al.,1984a 


SSEA3 


IgM 


Rat 


Human EC, 
ES Cells. 


differentiation 


Shevinsky et al 
1982, Fenderson 
et al 1987 


SSEA4 


IgG 


Mouse 


Human EC, 
ES cells. • 


differentiation 


Kannagi et al 
1983 Fenderson 
et al 1987 


A2B5 


IgM 


Mouse 




t 

differentiation 


Fenderson et al 
1987 


ME311 


IgG 


Mouse 




t 

differentiation 


Fenderson et al 
1987 


V3N-IS- 
56 


IgM 


Mouse 




T 

differentiation 


Andrews et al 
1990 ! 


. VIN-JS- 
53 ! 


IgG 


Mouse 




t 

differentiation 


Andrews et al 
1990 















15 

Table 2: Probes used to assess mRNA markers of differentiation 
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Gene 


Cell Type 


Synaptophysin 


Neuron 


NeuroDl 


Neuron 


MyoDl 


Muscle 


Collagens 


Cartilage 


Alpha- ac tin 


Skeletal muscle 


Smooth-muscle actin 


Smooth muscle 



5 



10 

Table 3: Protein markers of differentiation, detected by Western Blot and/or 
immunofluorescence. 

15 The following antibodies were detected by the appropriate commercially 
available antibodies 



Cell Type 


Antigen 


Neurons 


Neurofilaments 


Glial cells 


GFAP 


Epithelial cells 


Cytokeratins 


Mesenchymal cells 


Vimentin 


Muscle 


Desmin 


Muscle 


Tissue specific actios 


Connective tissue cells 


Collagens 
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Table 4: Specific Primers used to generate Stem Loop RNA for gene specific 
inhibition 



5 All sequences written 5' to 3' 





Gene Name 


Accession 
number 


PCR primer Sequences 


Position 


Notch Pathway 


Ligands: 










Dll-1 


AF003522 








D113 


NM_016941 








D114 


NM_019454 


■ 






Dlk-1 


NM_003836 




• 




Jaggedl 


U73936 








Jagged2 


NM__00222o 






Receptors: 










Notchl 


TV 

M7398U 


gcggccgcctttgtggttctgttc 

erf* /* rw err* err* erf" ^r* T "f^/"* /* 

gCCggCgCglCClCClCltCC 


Z>ZZ*t-D /ZO 


r-. 


Notch2 


Tn -bouse 

XXX JLLVS UOV 

sequence 


£cca£aat£at2ctacctfit 
tagagcagcaccaatggaac 


r 


/ 


Notch3 


U97669 


Aagttacccccaagaggcaagtgtt 
Aaggaaatgagaggccagaagga 

ga 


7013-7348 




Notch4 


U95299 


ggctgcccctcccactctcg 
cagcccgggccccaggatag 


3727-4132 


Downstream: 










TLE-1 


NM_005077 








TLE-2 


M99436 








TLE-3 


M99438 








TLE-4 


M99439 
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lK^r 1 


MM 00^2.02 








T/^T?T?T 9 

1 U.rJr J-/Z 


X X 1 D I/O 








ICro 


TV/T31^9^ 
lVu> I ^-^3 








icjriy 










1 V^T 1 


IN 1VX__U uwjtj 








infringe 


IN lVX__vUZ *f V J 








lfringe 










rFringe 


AT71 HR1 








bell 


iVr 1 D /DXO 








Numb 


JNIVI__UUj / hh 








LNX 


iNlVl_U I U / ^ / 






Wingless Pathw 


ay 








Ligands 










Wntl 


NM_005430 








Wnt2 


JNivl__UU J J y I 








Wnt2B 


JNM_UU41oD 


+rro rr+rrcrH"r'r*'f erf fiptn1"Cy 


1159-1503 




Wnt5A 










Wnt6 


AT?fV7CK99 








Wnt7A 


xnvr 00469*5 

JLN1YX UvHO^J 








Wnt8B 


xnv>f oo^^Q^ 

JLN1V1 UUjJ-O 








WntlOB 


XTA/T OO^^QA 
IN 1V1_UU:> j 








Wntll 


JNJML^UU^OZO 








Wntl4 


AT?no97n9 

,/Yx* U/o / U£ 








Wntl5 


a"fto9&70^ 








Wntl6 


AF169963 






Receptors 










FZD1 


NM.003505 








FZD2 


NM_001466 


tacccagagcggcctatcattttt 


955-1439 
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acgaagccggccaggaggaagga 
c 






FZD3 


NM_017412 








FZD4 


NM012193 








FZD5 


NM_003468 








FZD6 


NM_003506 


Tggcctgaggagcttgaatgtgac 
Atcgcccagcaaaaatccaatgaa 


607-1026 




FZD7 


NM_O035O7 








FZD8 


AA481448 








FZD9 


NM_003508 








FZD10 


NM 007197 








FRZB 


NM_001463 






Tr.Ytrfl cellular 

A " «.%*•%«'*****•** 

Effectors 












SFRP1 


NM_003012 








SFRP2 


AF017986 








SFRP4 


AF026692 


agaggagtggctgcaatgaggtc 
gcgcccggctgttttctt 


877-1178 




SFRP5 


NM_003015 






• - 


SK 


AB020315 








CER1 


NM_005454 








WIF-1 


NM_007191 








DVL1 


U46461 








DVL2 


NM_004422 








DVL3 


NM_004423 






Transcription 1 


factors 










Oct4 


Z11899 








Brachyury 


NM_003181 
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NeuroDl 


NM_002500 








NeuroDz 


NM_p06160 








NeuroD3 


U63842 








MyoD 


NM_002478 








MDFI 


NM_005586 








"DT7CT 


XN JYL UU D 0 1 Z 



























Table5 

5 Listed are examples of vector systems that are to be used in cells to direct the 
production of stem loop RNA. 



Expression System 


Vectors 


Accession numbers 


Promoters 


Tet-ou/Tet-off 
Clontech, USA 


pTet-on 

pTet-off 

pTRE2-Hyg 


U89930 
U89929 


CMV 

MyoDl 

NeuroDl 

Oct4 

GATAl 

Beta-actin 

PGK 


IRES 

Invitrogen, 
Nethelands) 


pIRES-EGFP 




CMV 

MyoDl 

NeuroDl 

Oct4 

GATAl 

Beta-actin 

PGK 


Ecdysone 

Invitrogen, 
Netherlands 


pIND 
pVgRXR 




CMV 

MyoDl 

NeuroDl 

Oct4 

GATAl 

Beta-actin 

PGK 
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CLAIMS 



1. 
i) 



A method to modulate the differentiation state of, stem cell comprising: . 

wti ch nucleic acid molecule census of a first part ^ ^ ^ ^ ^ 
said „ and second parts - complement^ ^ by 

^r wherein said first and second parts form 

co^lement^ 
l0 (ii) providing conditions conducive to the growtn 

tr^ted in (i) above; and optionally 

treated mW" the cell in a differentiated state, 

maintaining and/or stonng the cell m a 

to claim 1 wherein said first and second parts are linked 
2 . A method according to Claim l wn« 

15 by at least one nucleotide base. 

, ♦ or 2 wherein said nucleic acid molecule is a 

3 A method according to Claim 1 or 2 wh ^ ^ 

stem loop KNA molecule or a nucleic acid molecule 
loop KNA. 

20 f ri«im S 1-3 wherein said conditions are in vitro 

4. A method according to any of Claims 1 3 wner 

cell culture conditions. 

f Claims 1-4 wherein said stem cell is selected 
5 A method according to any of Claims 1 4 wn 

of- an embryonic stem cell; an embryonic germ cell, an 
25 from the group consisting of. an ry ^ & ^ 

embryonal carcinoma cell; a haemopoietic stem cell, m 

I^ce^ 

6 . A method according to any of Claims 1-5 wherein said stem cell is an 
30 embryonic stem cell or embryonic germ cell. 
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7. A method according to any of Claims 1-6 wherein said nucleic acid molecule 
is derived from at least one nucleic acid sequence as represented by Figures 4- 54. 

8. A RNA molecule derived from a coding sequence of at least one gene 
5 involved in stem cell differentiation comprising a first part linked to a second part 

wherein said first and second parts are complementary over at least part of their 
length and further wherein said first and second parts form a double stranded region 
by complementary base pairing over at least part of their length. 

10 9. A RNA molecule according to Claim 8 wherein said first and second parts 
are linked by at least one nucleotide base (nb). 

10. A RNA molecule according to Claim 9 wherein said first and second parts 
are linked by 2, 3, 4, 5, 6, 7, 8, 9, or lOnb in length. 

15 

11. A RNA molecule according to Claim 9 wherein said linker is at least lOnb in 
length. 

12. A RNA molecule according to any of Claims 8-1 1 wherein the length of the 
20 RNA molecule is between 1 Onb -1 OOOnb in length. 

13. A RNA molecule according to Claim 12 wherein the length of the RNA 
molecule is selected from lOnb; 20nb; 30nb; 40nb; 50nb; 60nb; 70nb; 80nb; 90nb in 
length. 

25 

14. A RNA molecule according to Claim 12 wherein said RNA molecule is 
lOOnb; 200nb; 300nb; 400nb; 500nb; 600nb; 700nb; 800nb; 900nb; or 1 OOOnb in 
length. 

30 15. A RNA molecule according to Claim 8 wherein said RNA molecule is at 

least 1 OOOnb in length. 
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16. A RNA molecule according to Claim 8 wherein said RNA molecule is 

21nb in length. 

5 17. A RNA molecule according to any of Claims 8-16 wherein said RNA 

molecule comprises sequences identified in Figures 4-54. 

18. A RNA molecule according to any of Claims 8-17 wherein said RNA 
molecules comprise modified nucleotide bases. 

10 

19. A nucleic acid molecule which encodes an RNA molecule according to any of 
Claims 8-18 wherein said nucleic acid molecule is operably linked to at least one 
further nucleic acid molecule capable of promoting transcription of said nucleic acid 
linked thereto. 

15 

20. A nucleic acid molecule according to Claim 19 wherein said further nucleic 
acid molecule is a promoter capable of inducible transcription. 

21 . A vector including a nucleic acid molecule according to Claim 19 or 20. 

20 

22. A cell transfected with an RNA molecule according to any of Claims 8-18, 
nucleic acid molecule according to Claim 19 or 20 or a vector according to Claim 
21. 

25 23. A cell according to Claim 22 wherein said cell is an embryonic stem cell or 
embryonic germ cell. 

24. A cell according to Claim 22 wherein said cell is an embryonal carcinoma 
cell. 

30 

25. A method to manufacture stem loop RNA molecules comprising: 
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(i) providing a nucleic acid molecule according to Claim 19 or 20 or a vector 
according to Claim 21; 

5 (ii) providing reagents and conditions which allow the synthesis of the RNA 
molecule comprising a RNA molecule according to any of Claims 8-1 8; and 

(iii) providing conditions which allow the RNA molecule to base pair over at least 
part of its length, or at least that part corresponding to the nucleic acid sequence 
10 encoding said stem cell gene which mediates stem cell differentiation. 

26. An'zn vivo method to promote the differentiation of stem cells comprising 
administering to an animal an effective amount of an RNA molecule according to any 
of Claims '8-18, a nucleic acid molecule according to Claim 19 or 20 or a vector 

15 according to Claim 21, sufficient to effect differentiation of a target stem cell. 

27. A RNA molecule according to any of Claims 8-18, a nucleic acid molecule 
according to Claim 19 or 20 or a vector according to Claim 21 for use as a 
pharmaceutical. 

20 

28. A pharmaceutical composition comprising a RNA molecule according to any 
of Claims 8-18, a nucleic acid molecule according to Claim 19 or 20 or a vector 
according to Claim 21. 

25 29. Use of a RNA molecule according to any of Claims 8-18, a nucleic acid 
molecule according to Claim 19 or 20 or a vector according to Claim 21 for the 
manufacture of a medicament for use in promoting the differentiation of stem cells to 
provide differentiated cells/tissues to treat diseases where cell/tissues are destroyed 
by said disease. 

30 
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30 Use according to Claim 29 wherein said disease is selected from the group 
consisting of: pernicious anemia; stroke, neurodegenerative diseases such as 
Parkinson's disease, Alzhiemer's disease; coronary heart disease; cirrhosis; 
diabetes; nerves damaged as a consequence of trauma (e.g. replacement of spinal 
5 cord tissue). 

31. A cell obtainable by the method according to any of Claims 1 -7. 

32. An organ comprising at least one cell according to Claim 31. 

10 

33. A non-human transgenic animal comprising a RNA molecule according to 
any of Claims 8-18, or a nucleic acid molecule according to Claim 19 or 20, or a 
vector according to Claim 2 1 . 

15 



20 



25 



30 
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Figure 1 
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Figure 2 



AGCAGCTTGGGCTCGAGAAGGATGTGGTCCGAGTGTGGTTCTGTAACCGGCGCCAG 

AAGGGCAAGCGATCAAGCAGCGACTATGCACAACGAGAGGATTTTGAGGCT^GG 

GTCTCCTTTCTCAGGGGGACCAGTGTCCTTTCCTCTGGCCCCAGGGCCCCATTTTGGT 

GCCCCAGGCTATGGGACK:CCTCACTTCACTGCACTGTACTCCTCCK3TCCCTTTCCCTG 

AGGGGGAAGCCTTTCCCCCTGTCTCTGTCACCACTCTGGGCTCTCCCTTGCATTCAAA 

CTGAGGTGCCTGCCTGCCCTTCTAGGAATGGGGGACAGGGGGAGGGGAGGAGCTAG 

GGAAAGAAAACCTGGAGTTTGTGCCAGGGTTTTTGGATTAAGTTCTTCATTCACTAA 

GGAAGGAATTGGGAACACAAAGGG 
3' 
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variable linker region 



RNA 
polymerase 
promoter 



cgagggtgcccg-agaauggat-(N) n -atccattctcgggcaccctcg 



Direction of Transcription 




B 

5'- cgagggt gc c c g a gaat g gat - (N ) n 
3'-(N) n -gctcccacgggctcttac ctaa-(N) 



Figure 3 
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Figure 3 

GTCCAGCGGTACCATGGGCCGTCGGAGCGCGCTAGCCCTTGCCGTGGTCTCTGCCCTGCTGTGC 

CAGGTCTGGAGCTCCGGCGTATTTGAGCTGAAGCTGCAGGAGTTCGTCAACAAGAAGGGGCTG 

CTGGGGAACCGCAACTGCTGCCGCGGGGGCTCTGGCCCGCCTTGCGCCTGCAGGACCITCTTTC 

GCGTATGCCTCAAGCACTACCAGGCCAGCGTGTCACCGGAGCCACCCTGCACCTACGGCAGTG 

CTGTCACGCCAGTGCTGGGTGTCGACTCCTTCAGCCTGCCTGATGGCGCAGGCATCGACCCCGC 

CITCAGCAACCCCATCCGATTCCCCTTCGGCTTCACCTGGCCAGGTACCTTCTCTCTGATCATTG 

AAGCCCTCCATACAGACTCTCCCGATGACCTCGCAACAGAAAACCCAGAAAGACTCATCAGCC 

GCCTGACCACACAGAGGCACCTCACTGTGGGAGAAGAATGGTCTCAGGACCTTCACAGTAGCG 

GCCGCACAGACCTCCGGTACTCTTACCGGTTTGTGTGTGACGAGCACTACTACGGAGAAGGTTG 

CTCTGTGTTCTGCCGACCTCGGGATGACGCCTTTGGCCACTTCACCTGCGGGGACAGAGGGGAG 

AAGATGTGCGACCCTGGCTGGAAAGGCCAGTACTGCACTGACCCAATCTGTCTGCCAGGGTGT . 

GATGACCAACATGGATACTGTGACAAACCAGGGGAGTGCAAGTGCAGAGTTGGCTGGCAGGGC 

CGCTACTGCGATGAGTGCATCCGATACCCAGGTTGTCTCCATGGCACCTGCCAGCAACCCTGGC 

AGTGTAACrGCCAGGAAGGCTGGGGGGGCCTTTTCTGCAACCAAGACCTGAACTACTGTACTCA 

CCATAAGCCGTGCAGGAATGGAGCCACCTGCACCAACACGGGCCAGGGGAGCTACACATGTTC 

CTGCCGACCTGGGTATACAGGTGCCAACTGTGAGCTGGAAGTAGATGAGTGTGCTCCTAGCCCC 

TGCAAGAAGGGAGCGAGCTGCACGGACCTTGAGGACAGCTTCTCTTGCACCTGCCCTCCCGGCT 

TCTATGGCAAGGTCTGTGAGCTGAGCGCCATGACCTGTGCAGATGGCCCTTGCTTCAATGGAGG 

ACGATGTTCAGATAACCCTGACGGAGGCTACACCTGCCATTGCCCCTTGGGCTTCTCTGGCTTC 

AACrGTGAGAAGAAGATGGATCTCTGCGGCTCTTCCCCTTGTTCTAACGGTGCCAAGTGTGTGG 

ACCTCGGCAACTCTTACCTGTGCCGGTGCCAGGCTGGCTTCTCCGGGAGGTACTGCGAGGACAA 

TGTGGATGACTGTGCCTCCTCCCCGTGTGCAAATGGGGGCACCTGCCGGGACAGTGTGAACGAC 

TTCTCCTGTACCTGCCCACCTGGCTACACGGGCAAGAACTGCAGCGCCCCTGTCAGCAGGTGTG 

AGCATGCACCCTGCCATAATGGGGCCACCTGCCACCAGAGGGGCCAGCGCTACATGTGTGAGT 

GCGCCCAGGGCTATGGCGGCCCCAACTGCCAGTTTCTGCTCCCTGAGCCACCACCAGGGCCCAT 

GGTGGTGGACCTCAGTGAGAGGCATATGGAGAGCCAGGGCGGGCCCTTCCCCTGGGTGGCCGT 

GTGTGCCGGGGTGGTGCTTGTCCTCCTGCTGCTGCTGGGCTGTGCTGCTGTGGTGGTCTGCGTCC 

GGCTGAAGGTACAGAAACACCAGCCTCCACCTGAACCCTGTGGGGGAGAGACAGAAACCATGA 

ACAACCTAGCCAATTGCCAGCGCGAGAAGGACGTTTCTGTTAGCATCATTGGGGCTACCCAGAT 

CAAGAACACCAACAAGAAGGCGGACTTTCACGGGGACCATGGAGCCAAGAAGAGCAGCTTTA 

aggtccgataccccactgtggactataacctcgttcgagacctcaagggagatgaagccacgg 

tcagggatacacacagcaaacgtgacaccaagtgccagtcacagagctctgcaggagaagag 

aagatcgccccaacacttaggggtggggagattcctgacagaaaaaggccagagtctgtctac 

tctacttcaaaggacaccaagtaccagtcggtgtatgttctgtctgcagaaaaggatgagtgtg 

ttatagcgactgaggtgtaagatggaagcgatgtggcaaaattcccatttctctcaaataaaat 

tccaaggatatagccccgatgaatgctgctgagagaggaagggagaggaaacccagggactg 

ctgctgagaaccaggttcaggcgaagctggttctctcagagttagcagaggcgcccgacactg 

ccagcctaggcirrggcrgccgctggactgcctgctggttgttcccattgcactatggacagttg 

ctttgaagagtatatatttaaatggacgagtgacttgattcatataggaagcacgcactgccca 

cacgtctatcirggattactatgagccagtctttccr^ 

TCCTlTTTGATACTGAGATGTGIlllllllTTTCCrAGACGGGAAAAAGAAAACGTGTGTTATTT 

TTTTGGGATTTGTAAAAATATTTTTC^ 

TITITATAATTTAAATTTTGGTAAAT^^ 

TTGTATATAAATGTATTTATGGAATATTGT 

GAAGAAATTCATTTTAAAAATATTTTrCCAAAAtAAATATAATGAACTACA 



Figure 4 

CGGGCAGAGGTGGAAGAGGGGGGAGCGCCTCAAAGAAGCGATCAGAATAATAAAAGGAGGCC 

GGGCTCTTTGCCTTCTGGAACGCGCGGCrCTTGAAAGGGCITITGAAAAGTAGTGTTG 

GTCGTGCATGCTCCAATCCACGGAGTATATTAGAGCCGGGACGCGGCGGCCGCGGGGGCAGCG 

ACGACGGCAGCCTCGGCGGGAGCACCAGCGCTAGCAGCGGCGGCGGCGTCCGGAGTGCCCGTG 

GCGCGCGGCGCAGCGATGCGGTCCCCACGGACGCGCGGCCGGCCCGGGCGCCCCCTGAGTCTT 



BNSDOCID: <WO 03012082A2_I_> 



WO 03/012082 PCT/GB02/03409 

5/41 

CrGCTCGCCCTGCTCTGTGCCCTGCGAGCCAAGGTGTGCGGGGCCTCGGGTCAGTTTGAGCTGG 

AGATCCTGTCCATGCAGAACGTGAATGGAGAGCTACAGAATGGGAACTGTTGTGGTGGAGTCC 

GGAACCCTGGCGACCGCAAGTGCACCCGCGACGAGTGTGATACGTACTTCAAAGTGTGCCTCA 

AGGAGTATCAGTCCCGCGTCACTGCCGGGGGACCCTGCAGCTTCGGCTCAGGGTCTACGCCTGT 

CATCGGGGGTAACACCTTCAATCTCAAGGCCAGCCGTGGCAACGACCGTAATCGCATCGTACTG 

CCITTCAGTITCGCCTGGCCGAGGTCCTAeACTTTGCTGGTGGAGGCCTGGGATTCCAGTAATG 

ACACTATrCAACCTGATAGCATAATTGAAAAGGCTTCTCACTCAGGCATGATAAACCCTAGCCG 

GCAATGGCAGACACTGAAACAAAACACAGGGATTGCCCACTTCGAGTATCAGATCCGAGTGAC 

CTGTGATGACCACTACTATGGCTTTGGCTGCAATAAGTTCTGTCGTCCCAGAGATGACITCTTTG 

GACATTATGCCTGTGACCAGAACGGCAACAAAACrTGCATGGAAGGCTGGATGGGTCCTGATT 

GCAACAAAGCTATCTGCCGACAGGGCTGCAGTCCCAAGCATGGGTCTTGTAAACTrCCAGGTG 

ACTGCAGGTGCCAGTACGGTTGGCAGGGCCTGTACTGCGACAAGTGCATCCCGCACCCAGGAT 

GTGTCCACGGCACCTGCAATGAACCCTGGCAGTGCCTCTGTGAGACCAACTGGGGTGGACAGC 

TCTGTGACAAAGATCTGAATTACTGTGGGACTCATCAGCCCTGTCTCAACCGGGGAACATGTAG 

CAACACTGGGCCTGACAAATACCAGTGCTCCTGCCCAGAGGGCTACTCGGGCCCCAACTGTGA 

AATTGCTGAGCATGCTTGTCTCTCTGACCCCTGCCATAACCGAGGCAGCTGCAAGGAGACCTCC 

TCAGGCTTTGAGTGTGAGTGTTCTCCAGGCTGGACTGGCCCCACGTGTTGCACAAACATCGATG 

ACTGTTCTCCAAATAACTGTTCCCATGGGGGCACCTGCCAGGATCTGGTGAATGGATTCAAGTG 

TGTGTGCCCGCCCCAGTGGACTGGCAAGACTTGTCAGTTAGATGCAAATGAGTGCGAGGCCAA 

ACCITGTGTAAATGCCAGATCCrGTAAGAATCTGATTGCCAGCTACrACTGTGATTGCCTTCCTG 

GCTGGATGGGTCAGAACTGTGACATAAATATCAATGACTGCCTTGGCCAGTGTCAGAATGACG 

CCTCCTGTCGGGATTTGGTTAATGGTTATCGCTGTATCTGTCCACCTGGCTATGCAGGCGATCAC 

TGTGAGAGAGACATCGATGAGTGTGCTAGCAACCCCTGCTTGAATGGGGGTCACTGTCAGAAT 

GAAATCAACAGATTCCAGTGTCrcrGTCCCACTGGTTTCTCTGGAAACCTCTGTCAGCTGGACA 

TCGATTACTGCGAGCCCAACCCTTGCCAGAATGGCGCCCAGTGCTACAATCGTGCCAGTGACTA 

TTTCTGCAAGTGCCCCGAGGACTATGAGGGCAAGAACTGCTCACACCTGAAAGACCACTGGCG 

TACCACCACCTGCGAAGTGATTGACAGCTGCACTGTGGCCATGGCCTCCAACGACACGCCTGAA 

GGGGTGCGGTATATCTCTTCTAACGTCTGTGGTCCCCATGGGAAGTGCAAGAGCCAGTCGGGAG 

GCAAATTCACCTGTGACTGTAACAAAGGCTTCACCGGCACCTACTGCCATGAAAATATCAACGA 

CTGCGAGAGCAACCCCTGTAAAAACGGTGGCACCTGCATCGATGGCGTTAACTCCrACAAGTGT 

ATCTGTAGTGACGGCTGGGAGGGAGCGCACTGTGAGAACAA CATA AATGACTGTAGCCAGAAC 

CCTTGTCACTACGGGGGTACATGTCGAGACCTGGTCAATGACTlnrTACrGTGAcrGCAAAAATG 

GCTGGAAAGGAAAGACTTGCCATTCCCGTGACAGCCAGTGTGACGAAGCCACGTGTAATAATG 

GTGGTACCTGCTATGATGAAGTGGACACGTTTAAGTGCATGTGTCCCGGTGGCTGGGAAGGAA 

CAACTTGfAATATAGCTAGAAACAGTAGCTGCCTGCCGAACCCCTGTCATAATGGAGGTACCTG 

CGTGGTCAATGGAGACTCCTTCACCTGTGTCrGCAAAGAAGGCTGGGAGGGGCCTATTTGTACT 

caaaataccaacgactgcagtccccatccttgttacaatagcgggacctgtgtggacgGagac 

aacrggtatcggtgcgaatgtgccccgggttttgctgggccagactgcaggataaacatgaatg 

agtgccagtcttccccttgtgcctttggggccacctgtgtggatgagatcaatggctaccagtg 

tatctgccctccaggacatagtggtgccaagtgccatgaagtttcagggcgatcttgcatcacc 

atggggagagtgatacttgatggggccaagtgggatgatgactgtaacacctgccagtgcctg 

aatggacgggtggcctgctccaaggtctggtgtggcccgagaccttgcaggctccacaaaagc 

cacaatgagtgccccagtgggcagagctgcatcccggtcctggatgaccagtgtttcgtgcgcc 

cctgcactggtgttggcgaatgtcggtcctccagcctccagccagtgaagaccaagtgcacatc 

tgactcctattaccaggataactgtgcaaacatcacittcacctttaacaaagagatgatgtct 

ccaggtcitaccaccgaacacatttgcagcgaattgaggaatttgaatatcctgaagaatgttt 

crgctgaatattcgatctacatagcctgtgagccttccctgtcagcaaacaatgaaatacacgt 

ggccatctctgcagaagacatccgggatgatgggaaccctgtcaaggaaattaccgataaaat 

aatagatctcgttagtaaacgggatggaaacagctcacttattgctgcggttgcagaagtcag 

agttcagaggcgtcctcrgaaaaacagaacagatttcctggttcctctgctgagctctgtcita 

acagtggcttgggtctgttgcttggtgacagccttctactggtgtgtacggaagcggcggaagc 

ccagcagccacactcactccgcccccgaggacaacaccaccaacaatgtgcgggagcagctga 

accaaatcaaaaaccccatcgagaaacacggagccaacacggtccccattaaggattacgaga 

acaaaaactccaaaatgtcaaaaatcaggacacacaactcggaagtggaggaggatgacatg 

gataaacaccagcagaaagtccgctttgccaaacagccagtgtatacgctggtagacagagag 

gagaaggcccccagcggcacgccgacaaaacacccgaactggacaaataaacaggacaacag 
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AGACTTGGAAAGTGCCCAGAGCTTGAACCGGATGGAATACATCGTATAGCAGACAGTGGGCTG 
CCGCCATAGGTAGAGTTTGAGGGCACCGCGGGCCG 



Figure 5 

CTGCGGCCGGCCCGCGAGCTAGGCTGGGTTTTTTTTTTTCTCCCCT 

TGATCTAAAAGGGAATAAAAGGCTGCGCATAATCATAATAATAAAAGAAGGGGAGCGCGAGAGAAGGA 

GAAAGCCGGGAGGTGGAAGAGGAGGGGGAGCGTCTCAAAGAAGCGATCAGAATAATAAAAGGAGGCGG 

CTCTTTGCCTTCTGGAACGGGCCGCTCTTGAAAGGGCTTTTGA 

TGCTCCAATCGGCGGAGTATATTAGAGCCGGGACGCGGCGGCCGCAGGGGCAGCGGCGACGGCAGCACG 
GCGGCAGCACCAGCGCGAACAGCAGCGGCGGCGTCCCGAGTGCCCGCGGCGCGCGGCGCAGCGATGCGT 
CCCCACGGACGCGCGGCCGGTCCGGGCGCCCCCTAAGCCTCCTGCTCGCCCTGCTCTGTGCCCTGCGAGC 

CAAGGTGTGTGGGGCCTCGGGTCAGTTCGAGTTGGAGATCCTGTCCATGCAGAACGTGAACGGGGAGCTG 

CAGAACGGGAACTGCTGCGGCGGCGCCCGGAACCCGGGAGACCGCAAGTGCACCCGCGACGAGTGTGAA 

CATACTTCAAAGTGTGCCTCAAGGAGTATCAGTCCCGCGTCACGGCCGGGGGGCCCTGCAGCTTCGGCTC 

AGGGTCCACGCCTGTCATCGGGGGCAACACCTTCAACCTCAAGGCCAGCCGCGGCAACGACCGCAACCC 

ATCGTGCTGCCTTTCAGTTTCGCCTGGCCGAGGTCCTATACGTTGCTTGTGGAGGCGTGGGATTCCAGTA 

ATGACACCGTTCAACCTGACAGTATTATTGAAAAGGCTTCTCACTCGGGCATGATCAACCCCAGCCGGCA 

GTGGCAGACGCTGAAGCAGAACACGGGCGTTGCCCACTTTGAGTATCAGATCCGCGTGACCTGTGATGAC 

TACTACTATGGCTTTGGCTGCAATAAGTTCTGCCGCCCCAGAGATGACTTCTTTGGACACTATGCCTGTG 

ACCAGAATGGCAACAAAACITGCATGGAAGGCTGGATGGGCCCCGAATGTAACAGAGCTATTTGCCGAA 

AGGCTGCAGTCCTAAGCATGGGTCTTGCAAACTCCCAGGTGACTGCAGGTGCCAGTATGGCTGGCAAGGC 

CTGTACTGTGATAAGTGCATCCCACACCCGGGATGCGTCCACGGCATCTGTAATGAGCCCTGGCAGTGCC 

TCTGTGAGACCAACTGGGGCGGCCAGCTCTGTGACAAAGATCTCAATTACTGTGGGACTCATCAGCCGTG 

TCTCAACGGGGGAACTTGTAGCAACACAGGCCCTGACAAATATCAGTGTTCCTGCCCTGAGGGGTATTCA 

GGACCCAACTGTGAAATTGCTGAGCACGCCTGCCTCTCTGATCCCTGTCACAACAGAGGCAGCTGTAAGG 

AGACXJrCCCTGGGCTTTGAGTGTGAGTGTrCCCCAGGCTGGACCGGCCCCACATGCTCTACAAACATTGA . 

TGACTGTTCTCCTAATAACTGTTCCCACGGGGGCACCTGCCAGGACCTGGTTAACGGATTTAAGTGTGTG 

TGCCCCCCACAGTGGACTGGGAAAACGTGCCAGTTAGATGCAAATGAATGTGAGGCCAAACCTTGTGTAA 

ACGCCAAATCCTGTAAGAATCrCATTGCCAGCTACTACTGCGACTGTCTTCCCGGCTGGATGGGTCAGAA 

TTGTGACATAAATATTAATGACTGCCTTGGCCAGTGTCAGAATGACGCCTCCTGTCGGGATTTGGTTAAT 

GGTTATCGCTGTATCTGTCCACCTGGCTATGCAGGCGATCACTGTGAGAGAGACATCGATGAATGTGCCA 

GCAACCCCTGTTTGAATGGGGGTCACTGTCAGAATGAAATCAACAGATTCCAGTGTCTGTGTCCCACTGG 

TTTCTCTGGAAACCTCTGTCAGCTGGACATCGATTATTGTGAGCCTAATCCCTGCCAGAACGGTGCCCAG 

TGCTACAACCGTGCCAGTGACTATTTCTGCAAGTGCCCCGAGGACTATGAGGGCAAGAACTGCTCACACC 

TGAAAGACCACTGCCGCACGACCCCCTGTGAAGTGATTGACAGCTGCACAGTGGCCATGGCTTCCAACGA 

CACACCTGAAGGGGTGCGGTATATTTCCTCCAACGTCTGTGGTCCTCACGGGAAGTGCAAGAGTCAGTCG 

GGAGGCAAATTCACCTGTGACTGTAACAAAGGCTTCACGGGAACATACTGCCATGAAAATATTAATGACT 

GTGAGAGCAACCCTTGTAGAAACGGTGGCACITGCATCGATGGTGTCAACTCCTACAAGTGCATCTGTAG 

TGACGGCTGGGAGGGGGCCTACTGTGAAACCAATATTAATGACTGCAGCCAGAACCCCTGCCACAATGG 

GGCACGTGTCGCGACCTGGTCAATGACTTCTACTGTGACTGTAAAAATGGGTGGAAAGGAAAGACCTGCC 

ACTCACGTGACAGTCAGTGTGATGAGGCCACGTGCAACAACGGTGGCACCTGCTATGATGAGGGGGATC 

TTTTAAGTGCATGTGTCCTGGCGGCTGGGAAGGAACAACCTGTAACATAGCCCGAAACAGTAGCTGCCTG 

CCCAACCCCTGCCATAATGGGGGCACATGTGTGGTCAACGGCGAGTCCTTTACGTGCGTCTGCAAGGAAG 

GCTGGGAGGGGCCCATCTGTGCTCAGAATACCAATGACTGCAGCCCTCATCCCTGTTACAACAGCGGCAC 

CTGTGTGGATGGAGACAACTGGTACCGGTGCGAATGTGCCCCGGGTTTTGCTGGGCCCGACTGCAGAATA 

AACATCAATGAATGCCAGTCTTCACCTTGTGCCTTTGGAGCGACCTGTGTGGATGAGATCAATGGCTACC 

GGTGTGTCTGCCCTCCAGGGCACAGTGGTGCCAAGTGCCAGGAAGTTTCAGGGAGACCTTGCATCACCAT 

GGGGAGTGTGATACCAGATGGGGCCAAATGGGATGATGACTGTAATACCTGCCAGTGCCTGAATGGACG 

ATCGCCTGCTCAAAGGTCTGGTGTGGCCCTCGACCTTGCCTGCTCCACAAAGGGCACAGCGAGTGCCCCA 

GCGGGCAGAGCTGCATCCCCATCCTGGACGACCAGTGCTTCGTCCACCCCTGCACTGGTGTGGGCGAGTG 

TCGGTCTTCCAGTCTCCAGCCGGTGAAGACAAAGTGCACCrCTGACTCCrATTACCAGGATAACTGTGCG 

AACATCACATTTACCITrAACAAGGAGATGATGTCACCAGGTCTTACTACGGAGCACATTTGCAGTGAAT 

TGAGGAATTTGAATATTTTGAAGAATGTTTCCGCTGAATATTCAATCTACATCGCIT 

TTCAGCGAACAATGAAATACATGTGGCCATTTCTGCTGAAGATATACGGGATGATGGGAACCCGATCAAG 

GAAATCACTGACAAAATAATCGATCTTGTTAGTAAACGTGATGGAAACAGCTCGCTGATTGCTGCCGTTG 

CAGAAGTAAGAGTTCAGAGGCGGCCTCTGAAGAACAGAACAGATTTCCTTGTTCCCTTGCTGAGCTCTGT 

CTTAACTGTGGCTTGGATCTGTTGCTTGGTGACGGCCTTCTACTGGTGCCTGCGGAAGCGGCGGAAGCCG 

GGCAGCCACACACACTCAGCCTCTGAGGACAACACCACCAACAACGTGCGGGAGCAGCTGAACCAGATA 
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aAAACCCCaTTGAGAAACATGGGGCCAACACGG 

GTCTAAAATAAGGACAGACAATTCTX3AAGTAGAAGAGGACGACATGGA 

GTTTGCCAAGCAGCCGGCGTATACGCTGGTAGACAGAGAAGAGAAGCCCCCCAACGGCACGCCGACAAC 
ACCCAAACTGGACAAACAAACAGGACAACAGAGACITGGAAAGTGCCCAGAGCTTA^ 
CATCGTATAGCAGACCGCGGGCACTGCCGCCGCTAGGTAGAGTCTGAGGGCTC^^ 

^ _ i _ _ 1: i „1 . . ^^^^^ . ™~ a /*^ a a TrprmTfiTT A ATTTA A GTTTTGACAAGCTG 



GCTTACACTGGCAATGGTAGTTTCTGTGGTTGGCTGGGAAATCXjAGTGCCGCATCTCACA 
AAGCTAGTCAACAGTACCCTGGTTGTGTGTCCCCTTGCAGCCGACACGGTCTCGGATCAGGCTCCCAGGA 
GCCTGCCCAGCCCCCTGGTCTTTGAGCTCCCACTTCTGCCAGATGTCCT 
CATAGTTTTATTTATATTTATTGACTCTTGA 

GTTCTGTATTTGAAAGTGCCTTTGCAGCTCAGAACCACAGCAACGATCAC^ 
TTTTTAATTGTATTTTTGTTGTTGGGGGAGGGGAGACTTTC 

TITAAAGAAAAAAATGTCAAAAGTAGAACTTTGTATAGTTATGTAAATAATTC1"1'1'1'1 lATTAATCACTG 
TGTATATTTGATTTATTAACTITAATAATCAA 

GTTTAGAATTGAAGGTTTTTGATAGCATTGTAAGCGTATGGCTT^ 

TGTTGCCTATAAGCCAAAATTAAGGTGTTTGAAAATAGTTTATTTTAAAACAATAGGATGGGCTTCT 

CCCAGAATACTGATGGAATTITIT1TGTACGACGTCAGATGT1TAAAA 

AACACGTTTTAAGGACTGACTGAGGCAGTTTGAGGATTAGTTTAGAACAGGTTTTTTT 

TTTGTTTTTCTGCTTTAGACTTGAAAAGAGACAGGCAGGTGATCTGCTGCAGAGCAGTAAGGGAACAAGT 

TGAGCTATGACTTAACATAGCCAAAATGTGAGTGGTTGAATATGATTAAAAATATCAAATTAATrGTGTG 

AACTTGGAAGCACACCAATCTGAC1TTGTAAATTCTGA 

CACITGTAGATTTGATTTTTTI 1 1 l AATCTACTGCATTTAGGGAGTATTCTAATAAGCTAGTTGAATACT 
TGAACCATAAAATGTCCAGTAAGATCACTGTTTAGATTTGCCATAGAGTACACTGCCTGCCTTAAGTGAG 
GAAATCAAAGTGCTATTACGAAGTTCAAG ATCAAAAAG GCTTATAAAACAGA GTAATCTTGTTGGTTCAC 
CATTGAGACCGTGAAGATACTTTGTATTGTCCTATTAGTGTTATATGAACATACAAATGCATCTTTGATG 

TGTTGTTCTTGGCAATAAATTITGAAAAGTAATATTTATTAAATTTTTT^ 
GTGGCTCTTCTGAGCITACGTAGTTCTACCGGCTTTGCCGTCT 

TGGTAATCGGGGTATAATAGGCTCTGCCTGACAGAGGGATGGAGGAAGAACTGAAAGGCTTTTCAACCC 

AAAACTCATCTGGAGTTCTCAAAGACCTGGGGCTGCTGTGAAGCTGGAACTGCGGGAGCCCCATCTAGGG 

GAGCCTTGATTCCCTTGTTATTCAACAGCAAGTGTGAATACTGCTTGAATAAACACCACTGGATTAATGG 

AAAAAAAAAAAAAAAA 



Figure 6 

GGAGCGGGCGCGCGGCGGCGGCGGGGCCGCGGCGGGCGGGTCGCGGGGGCAATGCGGGCGCAGGGCCG 

GGGCCrrCCCCCCGGCGCTGCTGCTGCTGCTGGCGCTCTGGGTGCAGGCGGCGCGGCCCATGGGCrATTT 

CGAGCTGCAGCTGAGCGCGCTGCGGAACGTGAACGGGGAGCTGCTGAGCGGCGCCTGCTGTGACGGCGC 

GGCCGGACAACGCGCGCGGGGGGCTGCGGCCACGACGAGTGCGACACGTACGTGCGCGTGTGCCTTAAG 

AGTACCAGGCCAAGGTGACGCCCACGGGGCCCTGCAGCTACGGCCACGGCGCCACGCCCGTGCTGGGCG 

CAACTCCTTCTACCTGCCGCCGGCGGGCGCTGCGGGGGACCGAGCGCGCGCGCGGCCCCGGGCCGGCGC 

GACCAGGACCCGGGCTTCGTCGTCATCCCCTrCCAGTTCGCCTGGCCGCGCTCCTTTACCCTCATCGTGG 

AGGCCTGGGACTGGGACAACGATACCACCCCGAATGAGGAGCTGCTGATCGAGCGAGTGTCGCATGCCG 

CATGATCAACCCGGAGGACCGCTGGAAGAGCCTGCACTTCAGCGGCCACGTGGCGCACCTGGAGCTGCG 

ATCCGCGTGCGCTGCGACGAGAACTACTACAGCGCCACTTGCAACAAGTTCTGCCGGCCCCGCAACGACT 

TTTTCGGCCACTACACCTGCGACCAGTACGGCAACAAGGCCTGCATGGACGGCTGGATGGGCAAGGAGTG 

CAAGGAAGCTGTGTGTAAACAAGGGTGTAATTTGCTCCACGGGGGATGCACCGTGCCTGGGGAGTGCAG 

TGCAGCTACGGCTGGCAAGGGAGGTTCTGCGATGAGTGTGTCCCCTACCCCGGCTGCGTGCATGGCAGTT 

GTGTGGAGCCCTGGCAGTGCAACTGTGAGACCAACTGGGGCGGCCTGCTCTGTGACAAAGACCTGAACTA 

CTGTGGCAGCCACCACCCCTGCACCAACGGAGGCACGTGCATCAACGCCGAGCCTGACCAGTACCGCTGC 

ACCTGCCCTGACGGCTACTCGGGCAGGAACTGTGAGAAGGCTGAGCACGCCTGCACCTCCAACCCGTGTG 

CCAACGGGGGCTCTTGCCATGAGGTGCCGTCCGGCTTCGAATGCCACTGCCCATCGGGCTGGAGCGGGCC 

CACCTGTGCCCTTGACATCGATGAGTGTGCTTCGAACCCGTGTGCGGCCGGTGGCACCTGTGTGGACCAG 

GTGGACGGCTTTGAGTGCATCTGCCCCGAGCAGTGGGTGGGGGCCACCTGCCAGCTGGACGTCAACGACT 

GTGAAGGGAAGCCATGCCTTAACGCTTTTTCTTGCAAAAACCTGATTGGCGGCTATTACTGTGATTGCAT 

CCCGGGCTGGAAGGGCATCAACTGCCATATCAACGTCAACGACTGTCGCGGGCAGTGTCAGCATGGGGC 

ACCTGCAAGGACCTGGTGAACGGGTACCAGTGTGTGTGCCCACGGGGCTTCGGAGGCCGGCATTGCGAGC 

TGGAACGAGACAAGTGTGCCAGCAGCCCCTGCCACAGCGGCGGCCTCTGCGAGGACCTGGCCGACGGCT 

CCACTGCCACTGCCCCCAGGGCTTCTCCGGGCCTCTCTGTGAGGTGGATGTCGACCTTTGTGAGCCAAGC 

CCCTGCCGGAACGGCGCTCGCTGCTATAACCTGGAGGGTGACTATTACTGCGCCTGCCCTGATGACTxTG 

GTGGCAAGAACTGCTCCGTGCCCCGCGAGCCGTGCCCTGGCGGGGCCTGCAGAGTGATCGATGGCTGCGG 
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GTCAGACGCGGGGCCTGGGATGCCTGGCACAGCAGCCTCCGGCGTGTGTGGCCCCCATGGACGCTGCGTC 

AGCCAGCCAGGGGGCAACTTTTCCTGCATCTGTGACAGTGGCrTTACTGGCACCTACTGCCATGAGAACA 

TTGACGACTGCCTGGGCCAGCCCTGCCGCAATGGGGGCACATGCATCGATGAGGTGGACGCCTTCCGCTG 

CTTCTGCCCCAGCGGCTGGGAGGGCGAGCTCTGCGACACCAATCCCAACGACTGCCTTCCCGATCCCTGC 

CACAGCCGCGGCCGCTGCTACGACCTGGTCAATGACTTCTACTGTGCGTGCGACGACGGCTGGAAGGGCA 

AGACCTGCCACTCACGCGAGTTCCAGTGCGATGCCTACACCTGCAGCAACGGTGGCACCTGCTACGACAG 

CGGCGACACCTTCCGCTGCGCCTGCCCCCCCGGCTGGAAGGGCAGCACCTGCGCCGTCGCCAAGAACAGC 

AGCTGCCTGCCCAACCCCTGTGTGAATGGTGGCACCTGCGTGGGCAGCGGGGCCTCCTTCTCCTGCATCT 

GCCGGGACGGCTGGGAGGGTCGTACrTGCACTCACAATACCAACGACTGCAACCCTCTGCCTTGCTACAA 

TGGTGGCATCTGTGTTGACGGCGTCAACTGGTTCCGCTGCGAGTGTGCACCTGGCTTCGCGGGGCCTGAC 

TGCCGCATCAACATCGACGAGTGCCAGTCCTCGCCCTGTGCCTACGGGGCCACGTGTGTGGATGAGATCA 

ACGGGTATCGCTGTAGCTGCCCACCCGGCCGAGCCGGCCCCCGGTGCCAGGAAGTGATCGGGTTCGGGAG 

ATCCTGCTGGTCCCGGGGCACTCCGTTCCCACACGGAAGCTCCTGGGTGGAAGACTGCAACAGCTGCCGC 

TGCCTGGATGGCCGCCGTGACTGCAGCAAGGTGTGGTGCGGATGGAAGCCTTGTCTGCTGGCCGGCCAGC 

CCGAGGCCCTGAGCGCCCAGTGCCCACTGGGGCAAAGGTGCCTGGAGAAGGCCCCAGGCCAGTGTCTGG 

ACCACCCTGTGAGGCCTGGGGGGAGTGCGGCGCAGAAGAGCCACCGAGCACCCCCTGCCTGCCACGCTC 

GGCCACCrGGACAATAACTGTGCCCGCCTCACCTTGCATTTCAACCGTGACCACGTGCCCCAGGGCACCA 

CGGTGGGCGCCATTTGCTCCGGGATCCGCTCCCTGCCAGCCACAAGGGCTGTGGCACGGGACCGCCTGCT 

GGTGTTGCTITGCGACCGGGCGTCCTCGGGGGCCAGTGCCGTGGAGGTGGCCGTGTCCTTCAGCCCTGCC 

AGGGACCTGCCTGACAGCAGCCTGATCCAGGGCGCGGCCCACGCCATCGTGGCCGCCATCACCCAGCGG 

GGAACAGCTCACTGCTCCTGGCTGTCACCGAGGTCAAGGTGGAGACGGTTGTTACGGGCGGCTCTTCCAC 

AGGTCTGCTGGTGCCTGTGCTGTGTGGTGCCTTCAGCGTGCTGTGGCTGGCGTGCGTGGTGCTGTGCGTG 

TGGTGGACACGCAAGCGCAGGAAAGAGCGGGAGAGGAGCCGGCTGCCGCGGGAGGAGAGCGCCAACAC 

AGTGGGCCCCGCTCAACCCCATCCGCAACCCCATCGAGCGGCCGGGGfGGCCACAAGGACGTGCTCTACCA 

GTGCAAGAACTTCACGCCGCCGCCGCGCAGGGCGGACGAGGCGCTGCCCGGGCCGGCCGGCCACGCGGC 

GTCAGGGAGGATGAGGAGGACGAGGATCTGGGCCGCGGTGAGGAGGACTCCCTGGAGGCGGAGAAGTTC 

TCTCACACAAATTCACCAAAGATCCTGGCCGCTCGCCGGGGAGGCCGGCCCACTGGGCCTCAGGCCCCAA 

AGTGGACAACCGCGCGGTCAGGAGCATCAATGAGGCCCGCTACGCCGGCAAGGAGTAGGGGCGGCTGCG 

CTGGGCCGGGACCCAGGGCCCTCGGTGGGAGCCATGCCGTCTGCCGGACCC GGAG CCGAGGCATGTGCT 

AGTTTCTTTATTTTGTGTAAAAAAACCACCAAAAACAAAAACCAAATGTT^ 

CCTTGTATAAATTATTCAGTAACTGTCAGGCTGAAAACAATGGAGTATTCTCGGATAGTTGCTAlllll'G 

TAAAGTTTCCGTGCGTGGCACrCGCTGTATGAAAGGAGAGAGCAAAGGGTGTCTGCGTCGTCACCAAATC 

GTAGCGTTTGTTACCAGAGGTTGTGCACTGTTTACAGAATCITCCITITATTCCT 

GTGGCTCCAGGCCAAAGTGCCGGTGAGACCCATGGCTGTGTTGGTGTGGCCCATGGCTGTTGGTGGGACC 

CGTGGCTGATGGTGTGGCCTGTGGCTGTCGGTGGGACTCGTGGCTGTCAATGGGACCTGTGGCTGTCGGT 

GGGACCTACGGTGGTCGGTGGGACCCTGGTTATTGATGTGGCCCTGGCTGCCGGCACGGCCCGTGGCTGT 

TGACGCACCTGTGGTTGTTAGTGGGGCCTGAGGTCATCGGCGTGCCCAAGGCCGGCAGGTCAACCTCGCG 

CrTGCTGGCCAGTCCACCCTGCCTGCCGTCTGTGCTTCCXCCTGCCCAGAACGCCCGCTCCAGCGATCTC 

TCCACTGTGCXITCAGAAGTGCCCTTCCTGCTGCGCAGCT^ 

GCTCGTGACAAGTGCCITCACACAGACCCCrCGCAACrGTCCACGCGTGCCGTGGCACCAGGCGCTGCCC 

ACCTGCCGGCCCCGGCCGCCCCrCCTCGTGAAAGTGCATTTTTGTAAATGTGTACATATTAAAGGAAGCA 
CTCTGTATATTTGATTGAATAATGCCACCAAAAAAAAAAAAAAAAAAAAATTCCTGCCC 



Figure 7 

TCGAGGCGGCGATGCGGGCACGCGGCTGGGGACGCCTGCCTCGGCGGCTGCTGCTGCTACTGG 

TTCTGTGCGTGCAGGCGACGCGGCCCATGGGCTATTTCGAGCTGCAGCTGAGCGCGCTGCGGAA 

CGTGAACGGGGAGCTGCTGAGCGGCGCCTGCTGTGACGGCGACGGCCGGACGACGCGCGCGGG 

GGGCTGCGGCCGCGACGAGTGCGACACGTACGTGCGCGTGTGCCTTAAGGAGTACCAGGCCAA 

GGTGACGCCCACGGGGCCCTGCAGCTACGGCTACGGCGCCACGCCCGTGCTGGGTGGCAACTC 

CTTCTACCTGCCGCCGGCGGGCGCTGCGGGGGACCGAGCGCGCGCGCGGTCTCGGACCGGCGG 

CCACCAGGACCCGGGCCTCGTCGTCATTCCCTTTCAGTTCGCCTGGCCGCGTTCTTTCACCCTCA 

TCGTGGAGGCCTGGGACTGGGACAATGACACCACTCCAGATGAGGAGCTGCTGATTGAGCGGG 

TGTCGCACGCTGGCATGATCAACCCCGAGGACCGCTGGAAGAGCCTGCACTTCAGCGGCCACG 

TGGCACACCTGGAGCTGCAGATCCGAGTGCGCTGTGATGAGAACTACTACAGTGCCACCTGCA 

ACAAGTTCTGCCGGCCCCGCAACGACTTCTTTGGCCACTATACCTGCGACCAGTACGGCAACAA 

GGCCTGCATGGATGGCTGGATGGGCAAAGAATGCAAAGAAGCCGTGTGTAAACAAGGATGTAA 

TTTGCTCCACGGGGGATGCACTGTGCCTGGGGAGTGCAGGTGCAGCTACGGCTGGCAGGGCAA 
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GTTCTGTGACGAGTGTGTCCCCTACCCTGGCTGCGTGCATGGCAGCTGTGTGGAGCCCTGGCAC 

tgtgacWgagaccaactggggtggcctgctctgcgacaaagacctgaactactgtggcagc 

CACCACCCCTGTGTCAACGGGGGTACCTGCATCAATGCTGAGCCTGACCAATACCTCTGCGCCT 
GCCCAGATGG^^ 

GTGCCAATGGGGGCTCTTGCCACGAAGTGCCATCTGGCTTTGAATGCCACTGTCCGTCAGGATG 
GAGCGGACCCAGCTGTGCGCTCGACATTGATGAGTGTGCCTCT^ 

ACCTGCGTGGATCAGGTGGACGGCTTCGAGTGCATCTGCCCGGAGCAGTGGGTGGGGGCTACT 
TGCCAGCTGGACGCCAATGAGTGTGAAGGGAAGCCGTGCCTTAATGCTTTITCTTGCAAAAACC 

TGATTGGCGGCTATTACTTGTGATTGCCT 

CAACGATTGTCATGGGCAGTGTCAGCATGGGGGCACCTGCAAGGACCTGGTCAATGGGTACCA 
GTCTGTGTGCCCGCGGGGCTITGGAGGTCGCCATTGCGAACTAGAGTACGACAAGTGTGCCAG 
CAGCCCCTGCCGCCGGGGTGGCATCTGCGAGGACCTGGTGGATGGCTTCCGCTGCCACTGCCCA 
CGGGGCCTCTCTGGGCTGCACrGTGAGGTGGACATGGATCTCTGTGAACCAAGCCCCTGCCTCA 

ACGGTGCTCGCTGCTACAACOT 

CAAGAACTGCTCAGTGCCCAGGGACACATGCCCTGGCGGGGCATGTAGAGTGATCGATGGCTG 

CGGGTTCGAGGCAGGGTCCAGGGCACGCGGTGTCGCACCCTCTGGTATATGTGGCCCTCACGG 

GCACTGCGTTAGCCTGCCTGGGGGAAACTTCTCCTGCATCTGTGACAGCGGCTTCACAGGCACC 

TACTGCCATGAAAACATTGACGACTGCATGGGCCAGCCCTGCCGCAACGGGGGCACGTGCATT 

GACGAAGTGGACTCCTrCCGCTGCTTCTGCCCCAGTGGCTGGGAAGGAGAACTCTGTGACATCA 

ATCCCAACGACTGCCTCCCCGACCCCTGCCACAGCCGCGGCCGCTGCTATGACCTGGTCAATGA 

CTTCTACTGTGCCTGTGACGATGGCTGGAAGGGCAAGACCTGCCACTCACGCGAGTTCCAGTGT 

GACGCCTACACCTGCAGCAACGGTGGCACATGCTATGACAGCGGCGACACCTTCCGCTGCGCG 

TGCCCTCCGGGCTGGAAGGGCAGCACCTGCACCATCGCCAAGAACAGCAGCTGTGTGCCCAAT 

CCCTGTGTGAATGGAGGCACCTGCGTGGGTAGCGOAGACTCTTTCTCCTGCATCTGCCGGGATG 

GCTGGGAGGGCCGCACCTGCACACATAACACCAATGACTGCAACCCTCTGCCCTGCTATAACG 

GAGGCATCTGTGTTGATGGCGTCAACTGGTTCCGCTGCGAGTGTGCGCCTGGCTTTGCGGGTCC 

TGACTGCCGTATCAACATTGATGAGTGCCAGTCCTCGCCCTGTGCCTACGGAGCCACGTGTGTG 

GATGAGATCAACGGGTAGCGCTGCAGCTGCCCACCAGGTCGTTCTGGCCCCAGGTGCCAGGAA 

GTGGTCATATTCACGAGGCCCTGCTGGTCCCGGGGAATGTCCTTCCCGCATGGGAGTTCCTGGA 

TGGAAGACTGCAACAGCTGCCGCTGCCTGGATGGCCACCGGGATTGTAGCAAGGTATGGTGCG 

GATGGAAGCCTTGCCTGCTCTCTGGTCAGCCCAGCGATCCGAGTGCCCAGTGCCCCCCAGGGCA 

GCAATGTCAGGAGAAGGCCGTGGGTCAGTGCTTGC AGCCACCCTGTGAGAACTGGGGGGAGTG 

TACAGCGGAGGAGCCTCTGCCACCCAGCACCCCCTGTCAGCCACGGAGCAGTCATTTGGACAA 

CAACTGTGCCCGACTCACACTGCGCTTCAACCGTGATCAAGTGCCTCAGGGCACCACCGTGGGC 

GCTATCTGCTCTGGAATCCGAGCCTTGCCTGCCACGAGGGCGGCGGCACACGACCGCCTCCTCC 

TGCTGCTTTGTGATCGAGCATCCTC 

TGCAAGGGACCTGCCTGACAGCAGCCTGATCCAGAGCACAGCCCACGCCATCGTGGCTGCTAT 

CACTCAGAGAGGAAATAGCTCACTGCTGCTGGCTGTCACCGAGGTCAAGGTGGAAACAGTTGT 

TATGGGTGGCTCTTCCACAGGTCTGTTGGTGCCCGTGCTGTGCAGCGTGTTCAGTGTGCTGTGGC 

TCGCCTGTGTGGTTATCTGCGTATGGTGGACACGAAAGCGCAGGAAAGAACGTGAGAGGAGCC 

GGCTACCACGGGATGAGAGCACCAACAACCAGTGGGCCCCGCTCAATCCCATCCGCAACCCCA 

TTGAGCGGCCAGGCGGCAGCGGTCTGGGAACTGGGGGCCACAAGGACATACTCTACCAGTGCA 

AAAACTTCACACCGCCGCCCCGCAGGGCAGGCGAGGCACTGCCCGGGCCAGCTGGCCATGGGG 

CTGGTGGGGAGGACGAGGAGGATGAAGAGCTGAGCCGTGGAGATGGGGACTCCCCAGAGGCA 

GAGAAGTTCATCTCACACAAGTTCACCAAAGACCCCAGCTGCTCCCTCGGAAGGCCAGGCTGCT 

GGGCTCCAGGGCCCAAAGTGGACAACCGCGCCGTCAGAAGTACCAAGGACGTGCGCCGTGCTG 

GCAGGGAGTAGCCAGCCACCAGGCTGGCACCCAGAACCCTTGCTGGCACCACGCTGCCTGCCG 

GACCATAGGAGGCCAAGGCCGTGTGCATAGTITCTTTATTITGTGTAAAAAACAAAACCAAAAC 

CAAAAAACAAATGTTTATTTTTTACGTTTCTTTAACCT^ 

CGGAAAACAACGGAGTATTCTCGGATCATTGCTATTTTTGTAAAGTTTCCGCGTCCGCACGCAC 
TGTGGCAGGAGAGCAGGGCGTGTGTATGTGTGTGTGTGTGTGTCCTCACC 

Figure 8 

GAAGGCCATGGTCTCCCCACGGATGTCCGGGCTC 

CCCCAGACACGGCCCGCTGGCGTCTTCGAGCTGCAGATCCACTCTTTCGGGCCGGGTCCAGGCCCTGGGG 
CCCCGCGGTCCCCCTGCAGCGCCCGGCTCCCCTGCCGCCTCTTCTTCAGAGTCTGCCTGAAGCCTGGGCT 
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CTCAGAGGAGGCCGCCGAGTCCCCGTGCGCCCTGGGCGCGGCGCTGAGTGCGCGCGGACCGGTCTACACC 

GAGCAGCCCGGAGCGCCCGCGCCTGATCTCCCACTGCCCGACGGGCTCTTGCAGGTGCCCTTCCGGGACG 

CCTGGCCTGGCACCITCTCTTTCATCATCGAAACCTGGAGAGAGGAGTTAGGAGACCAGATTGGAGGGCC 

CGCCTGGAGCCTGCTGGCGCGCGTGGCTGGCAGGCGGCGCTTGGCAGCCGGAGGCCCGTGGGCCCGGGC 

ATTCAGCGCGCAGGCGCCTGGGAGCTGCGCTTCTCGTACCGCGCGCGCTGCGAGCCGCCTGCCGTCGGGA 

CCGCGTGCACGCGCCTCTGCCGTCCGCGCAGCGCCCCCTCGCGGTGCGGTCCGGGACTGCGCCCCTGCGC 

ACCGCTCGAGGACGAATGTGAGGCGCCGCTGGTGTGCCGAGCAGGCTGCAGCCCTGAGCATGGCTTCTGT 

GAACAGCCCGGTGAATGCCGATGCCTAGAGGGCTGGACTGGACCCCTCTGCACGGTCCCTGTCTCCACCA 

GCAGCTGCCTCAGCCCCAGGGGCCCGTCCTCTGCTACCACCGGATGCCTTGTCCCTGGGCCTGGGCCCTG 

TGACGGGAACCCGTGTGCCAATGGAGGCAGCTGTAGTGAGACACCCAGGTCCTTTGAATGCACCTGCCCG 

CGTGGGTTCTACGGGCTGCGGTGTGAGGTGAGCGGGGTGACATGTGCAGATGGACCCTGCTTCAACGGCG 

GCrrGTGTGTCGGGGGTGCAGACCCTGACTCTGCCTACATCTGCCACTGCCCACCTGGTTTCCAAGGCTC 

CAACTGTGAGAAGAGGGTGGACCGGTGCAGCCTGCAGCCATGCCGCAATGGCGGACTCTGCCTGGACCG 

GGCCACGCCCTGCGCTGCCGCTGCCGCGCCGGCTTCGCGGGT<XTCGCTGCGAGCACGACCTGGACGACT 

GCGCGGGCCGCGCCTGCGCTAACGGCGGCACGTGTGTGGAGGGCGGCGGCGCGCACCGCTGCTCCTGCC 

GCTGGGC1TCGGCGGCCGCGACTGCCGCGAGCGCGCGGACCCGTGCGCCGCGCGCCCCTGTGCTCACGGC 

GGCCGCTGCTACGCCCACITCTCCGGCCTCGTCTGCGCTTGCGCTCCCGGCTACATGGGAGCGCGGTGTG 

AGTTCCCAGTGCACCCCGACGGCGCAAGCGCCTTGCCCGCGGCCCCGCCGGGCCTCAGGCCCGGGGACCC 

TCAGCGCTACCITITGCCTCCGGCTCTGGGACTGCTCGTGGCCGCGGGCGTGGCCGGCGCTGCGCTCrTG 

CTGGTCCACGTGCGCCGCCGTGGCCACTCCCAGGATGCTGGGTCTCGCTTGCTGGCTGGGACCCCGGAGC 

GGTCAGTCCACGCACTCCCGGATGCACTCAACAACCTAAGGACGCAGGAGGGTTCCGGGGATGGTCCGG 

CTCGTCCGTAGATTGGAATCGCCCTGAAGATGTAGACCCTCAAGGGATTTATGTCATATCTGCTCCTTCC 

ATCTACGCTCGGGAGGTAGCGACGCCCCTTITCCCCCCGCTACACACTGGGCGCGCTGGGCAGAGGCAGC 

ACCTGCTITITCCCTACCCTTCCTCGATTCTGTCCGTGAAATGAATTGGGTAGAGTCTCrGGAAGGTm 

AAGCCCATITrCAGTTCTAACTTACITTCATCCTATTTTGCATCCCTCTTATC 

ATCTTCTCTTT 
Figure 9 

AAACCGGAACGGGGCCCAACTrCTGGGGCCTGGAGAAGGGAAACGAAGTCCCCCCCGGTTTCCCGAGGT 

GCCTTTCCTCGGGCATCCITGGTTTCGGCGGGACTTCGCAGGGCGGATATAAAGAACGGCGCCTTTGGGA 

AGAGGCGGAGACCGGCITTAAAGAAAGAAGTCTTGGTCCTGCGGCTTGGGCGAGGCAAGGGCGAGGCAG 

GGCGCrTTCTGCCGACGCTCCCCGTGGCCCTACGATCCCCCGCGCGTCCGCCGCTGTTCTAAGGAGAGAA 

GTGGGGGCCCCCCAGGCTCGCGCGTGGAGCGAAGCAGCATGGGCAGTCGGTGCGCGCTGGCCCTGGCGT 

GCTCrCGGCCTTGCTGTGTCAGGTCTGGAGCTCTGGGGTGTTCGAACTGAAGCTGCAGGAGTTCGTCAAC 

AAGAAGGGGCTGCTGGGGAACCGCAACTGCTGCCGCGGGGGCGCGGGGCCACCGCCGTGCGCCTGCCGA 

CCTTCTTCCGCGTGTGCCTCAAGCACTACCAGGCCAGCGTGTCCCCCGAGCCGCCCTGCACCTACGGCAG 

CGCCGTCACCCCCGTGCTGGGCGTCGACrCCTTCAGTCTGCCCGACGGCGGGGGCGCCGACTCCGCGTTC 

AGCAACCCCATCCGCTTCCCCTTCGGCTTCACCTGGCCGGGCACCTTCrGTCTGATTATTGAAGCTCT 

ACACAGA1TCTCCTGATGACCTCGCAACAGAAAACCCAGAAAGACTCATCA.GCCGCCTGGCCACCCAGAG 

GCACCTGACGGTGGGCGAGGAGTGGTCCCAGGACCTGCACAGCAGCGGCCGCACGGACCTCAAGTACTC 

TACCGCTTCGTGTGTGACGAACACTACTACGGAGAGGGCTGCTCCGTTTTCTGCCGTCCCCGGGACGATG 

CCTTCGGCCACrTCACCTGTGGGGAGCGTGGGGAGAAAGTGTGCAACCCTGGCTGGAAAGGGCCCTACTG 

CACAGAGCCGATCTGCCTGCCrGGATGTGATGAGCAGCATGGATTTTGTGACAAACCAGGGGAATGCAAG 

TGCAGAGTGGGCTGGCAGGGCCGGTACTGTGACGAGTGTATCCGCTATCCAGGCTGTCTCCATGGCACCT 

GCCAGCAGCCCTGGCAGTGCAACTGCCAGGAAGGCTGGGGGGGCCirrTCTGCAACCAGGACCTGAACTA 

CTGCACACACCATAAGCCCTGCAAGAATGGAGCCACCTGCACCAACACGGGCCAGGGGAGCTACACTTC 

TCTTGCCGGCCTGGGTACACAGGTGCCACCTGCGAGCTGGGGATTGACGAGTGTGACCCCAGCCCTTGTA 

AGAACGGAGGGAGCTGCACGGATCTCGAGAACAGCTACTCCTGTACCTGCCCACCCGGCTTCTACGGCAA 

AATCTGTGAATTGAGTGCCATGACCTGTGCGGACGGCCCTTGCTTTAACGGGGGTCGGTGCTCAGACAGC 

CCCGATGGAGGGTACAGCTGCCGCTGCCCCGTGGGCTACTCCGGCTTCAACTGTGAGAAGAAAATTGACT 

ACTGCAGCTCTTCACCCTGTTCTAATGGTGCCAAGTGTGTGGACCTCGGTGATGCCTACCTGTGCCGCTG 

CCAGGCCGGCTTCTCGGGGAGGCACTGTGACGACAACGTGGACGACTGCGCCTCCTCCCCGTGCGCCAAC 

GGGGGCACCTGCCGGGATGGCGTGAACGACTTCTCCTGCACCTGCCCGCCTGGCTACACGGGCAGGAACT 

GCAGTGCCCCCGTCAGCAGGTGCGAGCACGCACCCTGCCACAATGGGGCCACCTGCCACCAGAGGGGCA 

CGGCTATGTGTGCGAATGTGCCCGAAGCTACGGGGGTCCCAACTGCCAGTTCCTGCTCCCCGAGCTGCCC 

CCGGGCCCAGCGGTGGTGGACCTCACTGAGAAGCTAGAGGGCCAGGGCGGGCCATTCCCCTGGGTGGCG 

TGTGCGCCGGGGTCATCCTTGTCCTCATGCTGCTGCTGGGCTGTGCCGCTGTGGTGGTCTGCGTCCGGCT 

GAGGCTGCAGAAGCACCGGCCCCCAGCCGACCCCTGCCGGGGGGAGACGGAGACCATGAACAACCTGGC 

AACTGCCAGCGTGAGAAGGACATCTCAGTCAGCATCATCGGGGCCACGCAGATCAAGAACACCAACAAA 

AGGCGGACTTCCACGGGGACCACAGCGCCGACAAGAATGGG1TCAAGGCCCGCTACCCAGCGGTGGACA 

TAACCTCGTGCAGGACCTCAAGGGTGACGACACCGCCGTCAGGGACGCGCACAGCAAGCGTGACACCAG 
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TGCCAGCCCCAGGGCTCCTCAGGGGAGGAGAAGGGGACCCCGACCACACTCAGGGGTGGAGAAGCATCG 

AAAGAAAAAGGCCGGACTCGGGCTGTTCAACTTCAAAAGACACCAAGTACCAGTCGGTGTACGTCATATC 

CGAGGAGAAGGATGAGTGCGTCATAGCAACTGAGGTGTAAAATGGAAGTGAGATGGCAAGACTCCCGTT 

CTCTTAAAATAAGTAAAATTCCAAGGATATATGCCCCAACGAATGCTGCTGAAGAGGAGGGAGGCCTCGT 

GGACTGCTGCTGAGAAACCGAGTTCAGACCGAGCAGGTTCTCCTCCTGAGGTCCTCGACGCCTGCCGACA 

GCCTGTCGCGGCCCGGCCGCCTGCGGCACTGCCTTCCGTGACGTCGCCGTTGCACTATGGACAGTTGCTC 

TTAAGAGAATATATATTTAAATGGGTGAACTGAATTACGCCTAAGAAGCATGCACTGCCTGAGTGTATAT 

TTTGGATTCTTATGAGCCAGTCTTTTCTTGAATTAGA^ 

ACGAAGATGTGCTTTTTCTAGATGGAAAAGATGTGTGTTATTTTTTGGATIT 

ATATCTGTAAAGCTTGAGTATTTTGTGATGTTCGTTTTTTATAATTTAAATT^ 

GGCACTTCGGGTCTATGTGACTATATTTTTTTG^^ 

TTTGAGTTTTTTACTGTTTTGTTAATGAA 

AGGAATTC 



Figure 10 

ATGGCGGCAGCGTCCCGGAGCGCCTCTGGCTGGGCGCTACTGCTGCTGGTGGCACTTTGGCAGCAGCGCG 

CGGCCGGCTCCGGCGTCTTCCAGCTGCAGCTGCAGGAGTTCATCAACGAGCGCGGCGTACTGGCCAGTGG 

GCGGCCrTGCGAGCCCGGCTGCCGGACTTTCTTCCGCGTCTGCCTTAAGCACTTCCAGGCGGTCGTCTCG 

CCCGGACCCTGCACCITCGGGACCGTCTCCACGCCGGTATTGGGCACCAACTCCTTCGCTGTCCGGGACG 

ACAGTAGCGGCGGGGGGCGCAACCCTCTCCAACTGCCCTTCAATTTCACCTGGCCGGGTACCTTCTCGCT 

CATCATCGAAGCTTGGCACGCGCCAGGAGACGACCTGCGGCCAGAGGCCTTGCCACCAGATGCACTCATC 

AGCAAGATCGCCATCCAGGGCTCCCTAGCTGTGGGTCAGAACTGGTTATTGGATGAGCAAACCAGCACCC 

TCACAAGGCTGCGCTACTCTTACCGGGTCATCTGCAGTGACAACTACTATGGAGACAACTGCTCCCGCCT 

GTGCAAGAAGCGCAATGACCACTTCGGCCACTATGTGTGCCAGCCAGATGGCAACITGTCCTGCCTGCCC 

GGTTGGACTGGGGAATATTGCCAACAGCCTATCTGTCTTTCGGGCrGTCATGAACAGAATGGCTACrrGCA 

GCAAGCCAGCAGAGTGCCTCTGCCGCCCAGGCTGGCAGGGCCGGCTGTGTAACGAATGCATCCCCCACAA 

TGGCTGTCGCCACGGCACCTGCAGCACTCCCTGGCAATGTACTTGTGATGAGGGCTGGGGAGGCCTGTTT 

TGTGACCAAGATCTCAACTACTGCACCCACCACTCCCCATGCAAGAATGGGGCAACGTGCTCCAACAGTG 

GGCAGCGAAGCTACACCTGCACCTGTCGCCCAGGCTACACTGGTGTGGACTGTGAGCTGGAGCTCAGCGA 

GTGTGACAGCAACCCCTGTCGCAATGGAGGCAGCTGTAAGGACCAGGAGGATGGCTACCACTGCCTGTGT 

CCTCCGGGCTACTATGGCCTGCATTGTGAACACAGCACCTTGAGCTGCGCCGACTCCCCCTGCTTCAATG 

GGGGCTCCTGCCGGGAGCGCAACCAGGGGGCCAACTATGCTTGTGAATGTCCCCCCAACTTCACCGGCTC 

CAACTGCGAGAAGAAAGTGGACAGGTGCACCAGCAACCCCTGTGCCAACGGGGGACAGTGCCTGAACCA 

GGTCCAAGCCGCATGTGCCGCTGCCGTCCTGGATTCACGGGCACCTACTGTGAACTCCACGTCAGCGACT 

GTGCCCGTAACCCITGCGCCCACGGTGGCACTTGCCATGACCTGGAGAATGGGCTCATGTGCACCTGCCC 

TGCCGGCTTCTCTGGCCGACGCTGTGAGGTGCGGACATCCATCGATGCCTGTGCCTCGAGTCCCTGCTTC 

AAGAGGGCCACCTGCTACACCGACCTCTCCACAGACACCTTTGTGTGCAACTGCCCTTATGGCTTTGTGG 

GCAGCCGCTGCGAGTTCCCCGTGGGCTTGCCGCCCAGCTTCCCCTGGGTGGCCGTCTCGCTGGGTGTGGG 

GCTGGCAGTGCTGCTGGTACTGCTGGGCATGGTGGCAGTGGCTGTGCGGCAGCTGCGGCTTCGACGGCCG 

GACGACGGCAGCAGGGAAGCCATGAACAACTTGTCGGACTTCCAGAAGGACAACCTGATTCCTGCCGCC 

AGCTTAAAAACACAAACCAGAAGAAGGAGCTGGAAGTGGACTGTGGCCTGGACAAGTCCAACTGTGGCA 

ACAGCAAAACCACACATTGGACTATAATCTGGCCCCAGGGCCCCTGGGGCGGGGGACCATGCCAGGAAG 

TTTCCCCACAGTGACAAGAGCTTAGGAGAGAAGGCGCCACTGCGGTTACACAGTGAAAAGCCAGAGTGC 

GGATATCAGCGATATGCTCCCCCAGGGACTCCATGTACCAGTCTGTGTGTTTGATATCAGAGGAGAGGAA 

TGAATGTGTCATTGCCACGGAGGTATAA 



Figure 11 

CTCGCAGGCTAGGAACCCGAGGCCAAGAGCTGCAGCCAAAGTCACTTGGGTGCAGTGTACTCCCTCACTA 

GCCCGCTCGAGACCCTAGGATTTGCTCCAGGACACGTACTTAGAGCAGCCACCGCCCAGTCGCCCTCACC 

TGGATTACCTACCGAGGCATCGAGCAGCGGAGTTTTTGAGAAGGCGACAAGGGAGCAGCGTCCCGAGGG 

AATCAGCTTTTCAGGAACTCGGCTGGCAGACGGGACTTGCGGGAGAGCGACATCCCTAACAAGCAGATTC 

GGAGTCCCGGAGTGGAGAGGACACCCCAAGGGATGACGCCTGCGTCCCGGAGCGCCTGTCGCTGGGCGT 

ACTGCTGCTGGCGGTACTGTGGCCGCAGCAGCGCGCTGCGGGCTCCGGCATCTTCCAGCTGCGGCTGCAG 

GAGTTCGTCAACCAGCGCGGTATGCTGGCCAATGGGCAGTCCTGCGAACCGGGCTGCCGGACrrTCTTCC 

GCATITGCCTTAAGCACTTCCAGGCAACCTTCTCCGAGGGACCCTGCACCTTTGGCAATGTCTCCACGCC 

GGTATTGGGCACCAACTCCTTCGTCGTCAGGGACAAGAATAGCGGCAGTGGTCGCAACCCTCTGCAGTTG 

CCCITCAATTTCACCTGGCCGGGAACCTTCTCACTCAACATCCAAGCTTGGCACACACCGGGAGACGACC 

TGCGGCCAGAGACTTCGCCAGGAAACTCTCTCATCAGCCAAATCATCATCCAAGGCTCTCTTGCTGTGGG 
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TAAGATTTGGCGAACAGACGAGCAAAATGACACCCTCACCAGACTGAGCTACTCTTACCGGGTCATCTGC 

AGTGACAACTACTATGGAGAGAGCTGTTCTCGCCTATGCAAGAAGCGCGATGACCACTTCGGACATTATG 

AGTGCCAGCCAGATGGCAGCCTGTCCTGCCTGCCGGGCTGGACTGGGAAGTACTGTGACCAGCCTATATG 

TCmCTGGCTGTCATGAGCAGAATGGTTACTGCAGCAAGCCAGATGAGTGCATCTGCCGTCCAGGTTGG 

CAGGGTCGCCTGTGCAATGAATGTATCCCCCACAATGGCTGTCGTCATGGCACCTGCAGCATCCCCTGGC 

AGTGTGCCTGCGATGAGGGATGGGGAGGTCrGTTTTGTGACCAAGATCTCAACTACTGTACTCACCACTC 

TCCGTGCAAGAATGGATCAACGTGTTCCAACAGTGGGCCAAAGGGTTATACCTGCACCTGTCTCCCAGGC 

TACACTGGTGAGCACTGTGAGCTGGGACTCAGCAAGTGTGCCAGCAACCCCTGTCGAAATGGTGGCAGCT 

GTAAGGACCAGGAGAATAGCTACCACTGCCTGTGTCCCCCAGGCTACTATGGCCAGCACTG^ 

TACCTrGACCTGTGCGGACTCACCCTGCTTCAATGGGGGCTCTTGCCGGGAGCGCAACCAGGGGTCCAGT 

TATGCCTGCGAATGCCCCCCCAACTTTACCGGCTCTAACTGTGAGAAGAAAGTAGACAGGTGTAC^ 

ACCCGTGTGCCAATGGAGGCCAGTGCCTGAACAGAGGTCCAAGCCGAACCTGCCGCTGCCGGCCTGGATT 

CACAGGCACCCACTGTGAACTGCACATCAGCGATTGTGCCCGAAGTCCCTGTGCCCACGGGGGCACTTGC 

CACGATCTGGAGAATGGGCCTGTGTGCACCTGCCCCGCTGGCTTCTCTGGCAGGCGCTGCGAGGTGCGGA 

TAACCCACGATGCCTGTGCCTCCGGACCCTGCTTCAATGGGGCCACCTGCTACACTGGCCTCTCCCCAAA 

CAACnTCGTCTGCAACTGTCCTTATGGCTTTGTGGGCAGCCGCTGCGAGTTTCCCGTGGGCTTGCCACCG 

AGCTTCCCCTGGGTAGCTGTCTCGCTGGGCGTGGGGCTAGTGGTACTGCTGGTGCTGCTGGTCATGGTGG 

TAGTGGCTGTGCGGCAGCTGCGGCTTCGGAGGCCCGATGACGAGAGCAGGGAAGGCATGAACAATCTGC 

AGACTTCCAGAAGGACAACCTAATCCCTGCCGCCCAGCTCAAAAACACAAACCAGAAGAAGGAGCTGGA 

GTGGACTGTGGTCTGGACAAGTCCAATTGTGGCAAACTGCAGAACCACACATTGGACTACAATCTAGCCC 

CGGGACTCCTAGGACGGGGCAGCATGCCTGGGAAGTATCCTCACAGTGACAAGAGCTTAGGAGAGAAGT 

GCCACTTCGGTTACACAGTGAGAAGCCAGAGTGTCGAATATCAGCCATTTGCTCTCCCAGGGACTCTATG 

TACCAATCAGTGTGTTTGATATCAGAAGAGAGGAACGAGTGTGTGATTGCCACAGAGGTATAAGGCAGA 

GCCTACTCAGACACCCAGCTCCGGCCCAGCAGCTGGGCCTTCCITCTGCATrGTTrACATTGCATCCTGT 

ATGGGACATCTTTAGTATGCACAGTGCTGCTCTGCGGAGGAGGAGGGAATGGCATGAACTGAACAGACG 

TGAACCCGCCAAGAGTTGCACCGGCTCTGCACACCTCCAGGAGTCTGCCTGGCTTCAGATGGGCAGCCCC 

GCCAAGGGAACAGAGTTGAGGAGTTAGAGGAGCATCAGTTGAGCTGATATCTAAGGTGCCTCTCGAACTT 

GGA(nTGCTCTGCCAACAGTGGTCATCATGGAGCTCTTGACTGTTCTCCAGAGAGTGGCAGTGGCCCTAG 

TGGGTCTTGGCGCTGCTGTAGCTCCTGTGGGCATCTGTATTTCCAAAGTGCX^rTTGCCCAGACTCCATCC 

TCACAGCTGGGCCCAAATGAGAAAGCAGAGAGGAGGCTTGCAAAGGATAGGCCTCCCGCAGGCAGAACG 

CCTrGGAGTTTGGCATTAAGCAGGAGCTACTCTGCAGGTGAGGAAAGCCCGAGGAGGGGACACGTGTGC 

TCCTGCCTCCAACCCCAGCAGGTGGGGTGCCACCTGCAGCCTCTAGGCAAGAGTTGGTCCTTCCCCTGGT 

CCTGGTGCCTCTGGGCTCATGTGAACAGATGGGCTTAGGGCACGCCCCTTTTGCCAGCCAGGGGTACAGG 

CCTCACTGGGGAGCTCAGGGCCTTCATGCTAAACTCCCAATAAGGGAGATGGGGGGAAGGGGGCTGTGC 

CTAGGCCCl^CCCTCCCrrCACACCCATTTTTGGGCCCITGAGCCrGGGCTCCAC^ 

CCCGAGACCAACCTTGAAGCCGATTlTCAAAAATCAATAATATGAGGTTTrGTrTTGTAGTTTATTT^ 

AATCTAGTATTTTGATAATTTAAGAATCAGAAGCACrGGCCTTTCTACATTTTATA^ 

ATAATGTGTAT1TATAATATGAAACAGATGTGTACATAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Figure 12 

AAACCCACTCCACCnTACTACCAGACAACCTTAGCCAAACGATTTACCCAAATAAAGTATAGGC 

GATAGAAATTGAAACCTGGCGCAATAGATATAGTACCGCAAGGGAAAGATGAAAAATTATAAC 

CAAGCATAATATAGCAAGGACTAACCCCTATACCTTCTGCATAATGAATTAACTAGAAATAACT 

TrGCAAGGAGAGTCAAAGCTAAGGCCCCCGAAACCAGGCGAGCTACCTAAGAACAGCTAAAA 

GAGCACACCCGTCTATGTAGCAAAATAGTGGGAAGATTTATAGGTAGAGGCGACAAACCTACC 

GAGCCrGGTGATAGCTGGTTGTCCAAGATAGAATCTTAGTTCAACTTTAAATTTGCCCACAGAA 

CCCTCTAAATCCCCTTGTAAATTTAACTGTTAGTCCAAAGAGGAACAGCTCTTTGGACACTAGG 

AAAAAACCTTGTAGAGAGAGTGTGAGCCCAATTCCACACIT1TCCACATGTTGGATGGCCTTGG 

AGTGGTAGCCATAAGCATTTTTGGAATTCAACTAAAAACTGAAGGATCCTTGAGGACGGCAGT 

ACCTGGCATACCTACACAGTCAGCGTTCAACAAGTGTTTGCAAAGGTACATTGGGGGACTGGG 

GGCACGAGTGATCTGTGACAATATCCCTGGTTTGGTGAGCCGGCAGCGGCAGCTGTGCCAGCGT 

TACCCAGACATCATGCGTTCAGTGGGCGAGGGTGCCCGAGAATGGATCCGAGAGTGTCAGCAC 

CAATTCCGCCACCACCGCrGGAACrGTACCACCCTGGACCGGGACCACACCGTCTTTGGCCGTG 

TCATGCTCAGAAGTAGCCGAGAGGCAGCTTTTGTATATGCCATCTCATCAGCAGGGGTGATCCA 

CGCTATTACTCGCGCCTGTAGCCAGGGTGAACTGAGTGTGTGCAGCTGTGACCCCTACACCCGT 

GGCCGACACCATGACCAGCGTGGGACTTTTGACTGGGGTGGCTGCAGTGACAACATCCACTAC 

GGTGTCCGTTTTGCCAAGGCCTTCGTGGATGCCAAGGAGAAGAGGCTTAAGGATGCCCGGGCC 

CTCATGAACTTACATAATAACCGCTGTGGTCGCACGGCTGTGCGGCGGTTTGTCAAGCTGGAGT 
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GTAAGTGCCATGGCGTGAGTGGTTCCTGTACTCTGCGCACCTGCrGGCGTGCACTCTCAGATrT 

CCGCCGCACAGGTGATTACCTGCGGCGACGCTATGATGGGGCTGTGCAGGTGATGGCCACCCA 

AGATGGTGCCAACTTCACCGCAGCCCGCCAAGGCTATCGCCGTGCCACCCGGAGTGATCTTGTC 

TACTTTGACAACTCTCCAGATTACTGTGTCTTGGACAAGGCTGCAGGTTCCCTAGGCACTGCAG 

GCCGTGTCTGCAGCAAGACATCAAAAGGAACAGACGGTTGTGAAATCATGTGCTGTGGCCGAG 

GGTACGACACAACTCGAGTCACCCGTGTTACCCAGTGTGAGTGCAAATTCCACTGGTGCTGTGC 

TGTACGGTGCAAGGAATGCAGAAATACTGTGGACGTCCATACTTGCAAAGCCCCCAAGAAGGC 

AGAGTGGCTGGACCAGACCTGAACACACAGATACCTCACTCATCCCTCCAATTCAAGCCTCTCA 

ACTCAAAAGCACAAGATCCTTGCATGCACACCTTCCTCCACCCTCCACCCTGGGCTGCTACCGC 

TTCTATITAAGGATGTAGAGAGTAATCCATAGGGACCATGGTGTCCTGGCTGGTTCCTTAGCCC 

TGGGAAGGAGTTGTCAGGGGATATAAGAAACTGTGCAAGCTCCCTGATTTCCCGCTCTGGAGAT 

TTGAAGGGAGAGTAGAAGAGATAGGGGGTCTTTAGAGTGAAATGAGTTGCACTAAAGTACGTA 

GTTGAGGCTCCTTTTTTCITrCCTTTC 

GGTACCTGTAGAGAGCTTCTITITGTITCTACCTGGCCAAAGTTAGATGGGACAAAGATGAATG 

GCATGTCCCTTCTCTGAAGTCCGTTTGAGCAGAACTACCTGGTACCCCGAAAGAAAAATCTTAG 

GCTACCACATTCTATTATTGAGAGCCTGAGATGTTAGCCATAGTGGACAAGGTTCCATTCACAT 

GCTCATATGTTTATAAACTGTGTTTTGTAGAAGAAAAAGAATCATAACAATACAAACACACATT 

CATTCTCTCTTTTTCTCTCTACCATTCTCAACCTGTATTGGACAGCACTGCCT 

GCTGCCTGTTCAAACTGAGGTGGAATGCAGTGGTTCCCATGCTTAACAGATCATTAAAACACCC 

TAGAACACTCCTAGGATAGATTAATGT 
Figure 13 



ACCGCAGGGGGCTCCCGGACCCTGACTCTGCAGCCGAACCGGCACGGTTTCGTGGGGACCCAG 
GCTTGCAAAGTGACGGTCATTTTCTCTTTCTTTCTCCCTCITGAGTCCTTCTGAGATGATGGCTCT 
GGGCGCAGCGGGAGCTACCCGGGTCTTTGTCGCGATGGTAGCGGCGGCTCTCGGCGGCCACCC 
TCTGCTGGGAGTGAGCGCCACCTTGAACTCGGTTCTCAATTCCAACGCTATCAAGAACCTGCCC 

ccaccgctgggcggcgctgcggggcacccaggctctgcagtcaGcgccgcgccgggaatcctg 

TACCCGGGCGGGAATAAGTACCAGACCATTGACAACTACCAGCCGTACCCGTGCGCAGAGGAC 

GagGagtgcgGcactgatgagtactgcgctagtcccacccgcggaggggacgcaggcgtgcaa 
atctgtctcgcctgcaggaagcgccgaaaacgctgcatgcGtcacgctatgtgctgccccggga 
attactgcaaaaatggaatatgtgtgtcttctgatcaaaatcatttccgaggagaaattgagga 

AACCATCACTGAAAGCTTTGGTAATGATCATAGCACCTTGGATGGGTATTCCAGAAGAACCACC 
"S TTGTCTTCAAAAATGTATCACACCAAAGGACAAGAAGGTTCTGTTTGTCTCCGGTCATCAGACT 
■GTGCCTCAGGATTGTGTTGTGCTAGACACTTCTGGTCCAAGATCTGTAAACCTGTCCTGAAAGA 
AGGTCAAGTGTGTACCAAGCATAGGAGAAAAGGCTCTCATGGACTAGAAATATTCCAGCGTTG 
TTACTGTGGAGAAGGTCTGTCTTGCCGGATACAGAAAGATCACCATCAAGCCA GTAAT TCTTCT 
AGGCTTCACACTTGTCAGAGACACTAAACCAGCTATCCAAATGCAGTGAACTCCTTTTATATAA 
TAGATGCTATGAAAACGTITTATGACCTTCATCAACTCAATCCTAAGGATATACAAGTTCTGTG 
GTTTCAGTTAAGCATTCCAATAACACCTTCCAAAAACCTGGAGTGTAAGAGCTTTGTTTCrTTAT 
GGAACTCCCCTGTGATTGCAGTAAATTACTGTATTGTAAATTCTCAGTGTGGCACTTACCTGTAA 
ATGCAATGAAACTTTTAATTATTTTTCTA^ 

AATITITGTACACATTGATTGTTATCTTGACTGACAAATATTCTATATTGAACTGAAGTAAATCA 
TTTCAGCTTATAGTTCTTAAAAGCATAACCCTTTACCCCATTTAATTCTAGAGTCT 
GGATCTCTTGGAATGACAAATGATAGGTACCTAAAATGTAACATGAAAAT ACTAG CTTATTTTC 
TGAAATGTACTATCTTAATGCTTAAATTATATTTCCCTTTAGGCTGTGATAGlilllGAAATAAA 

ATTTAACATTTAATATCATGAAATGTTATAA 
Figure 14 

AGAAAGCGGGAGCCCGCGGCGAGCGTAGCGCAAGTCCGCTCCCTAGGCATCGCTGCGCTGGCA 
GCGATTCGCTGTCTCTTGTGAGTCAGGGGACAACGCTTCGGGGCAACTGTGAGTGCGCGTGTGG 
GGGACCTCGATTCTCTTCAGATCTCGAGGATTCGGTCCGGGGACGTCTCCTGATCCCCTACTAA 



BNSDOCID: <WO 0301 20S2A2 J_> 



WO 03/012082 



PCT/GB02/03409 



14/41 : 

AGCGCCTGCTAACTTTGAAAAGGAGCACTGTGTCCTGCAAAGTTTGACACATAAAGGATAGGA 

AAAGAGAGGAGAGAAAAGCAACTGAGTTGAAGGAGAAGGAGCTGATGCGGGCCTCCTGATCA 

ATTAAGAGGAGAGTTAAACCGCCGAGATCCCGGCGGGACCAAGGAGGTGCGGGGCAAGAA.GG 

AACGGAAGCGGTGCGATCCACAGGGCTGGGTTTTCTTGCACCTTGGGTCACGCCTCCTTGGCGA 

GAAAGCGCCTCGCATTTGATTGCTTCCAGTTATTGCAGAACTTCCTGTCCTGGTGGAGAAGCGG 

GTCTCGCTTGGGTTCCGCTAATTTCTGTCCTGAGGCGTGAGACTGAGTTCATAGGGTCCTGGGTC 

CCCGAACCAGGAAGGGTTGAGGGAACACAATCTGCAAGCCCCCGCGACCCAAGTGAGGGGCCC 

CGTGTTGGGGTCCTCCCTCCCTTTGCATTCCCACCCCTCCGGGCTTTGeGTCTTCCTGGGGACCC 

CCTCGCCGGGAGATGGCCGCGTTGATGCGGAGCAAGGATTCGTCCTGCTGCCTGCTCCTACTGG 

CCGCGGTGCTGATGGTGGAGAGCTCACAGATCGGCAGTTCGCGGGCCAAACTCAACTCCATCA 

AGTCCTCTCTGGGCGGGGAGACGCCTGGTCAGGCCGCCAATCGATCTGCGGGCATGTACCAAG 

GACTGGCATTCGGCGGCAGTAA.GAAGGGCAAAAACCTGGGGCAGGCCTACCCTTGTAGCAGTG 

ATAAGGAGTGTGAAGTTGGGAGGTATTGCCACAGTCCCCACCAAGGATCATCGGCCTGCATGG 

TGTGTCGGAGAAAAAAGAAGCGCTGCCACCGAGATGGCATGTGCTGCCCCAGTACCCGCTGCA 

ATAATGGCATCTGTATCCCAGTTACTGAAAGCATCTTAACCCCTCACATCCCGGCTCTGGATGG 

TACTCGGCACAGAGATCGAAACCACGGTCATTACTCAAACCATGACTTGGGATGGCAGAATCT 

AGGAAGACCACACACTAAGATGTCACATATAAAAGGGCATGAAGGAGACCCCTGCCTACGATC 

ATCAGACTGCATTGAAGGGTTTTGCTGTGCTCGTCATTTCTGGACCAAAATCTGCAAACCAGTG 

CTCCATCAGGGGGAAGTCTGTACCAAACAACGCAAGAAGGGTTCTCATGGGCTGGAAATTTTC 

CAGCGTTGCGACTGTGCGAAGGGCCTGTCTTGCAAAGTATGGAAAGATGCCACCTACTCCTCCA 

AAGCCAGACTCCATGTGTGTCAGAAAATTTGATCACCATTGAGGAACATCATCAATTGCAGACT 

GTGAAGTTGTGTATTTAATGCATTATAGCATGGTGGAAAATAAGGTTCAGATGCAGAAGAATG 

GCTAAAATAAGAAACGTGATAAGAATATAGATGATCACAAAAAGGGAGAAAGAAAACATGAA 

CTGAATAGATTAGAATGGGTGACAAATGCAGTGCAGCCAGTGTTTCCATTATGCAACTTGTCTA 

TGTAAATAATGTACACATTTGTGGAAAATGCTATTATTAAGAGAACAAGCACACAGTGGAAAT 

TACrGATGAGTAGCATGTGACTTTCCAAGAGTTTAGGTTGTGCTGGAGGAGAGGTTTCCTTCAG 

attgctgatrgcttatacaaataacctacatgccagatttctattcaacgttagagtttaacaa 

aatactcctagaataacttgttatacaataggttctaaaaataaaattgctaaacaagaaatga 

aaacatggagcattgttaatttacaacagaaaattaccttttc 

ttcaatcaagagtcitggtagataagaaaaaaatcagtcaatatttccaaataattgcaaaata 

atggccagttgtttaggaaggcctttaggaagagaaataaataagaaacaaacagccacaaat 

acttttttttcaaaattttagttttacctgtaa 

ccttcagattctacggaatgacagtatatctctctttatcctAtgtgattcctgctctgaatgca 

ttatattttccaaacratacccataaattgtgactagtaaaatacttacacagagcagaattit 

cacagatggcaaaaaaatttaaagatgtccaatatatgtgggaaaagagctaagagagagatc 

attatttcttaaagattggccataacctgtattttgatagaattagatrggtaaatacatgtatt 

catacatactctgtggtaatagagacttgagctggatctgtactgcactggagtaagcaagaa 

aattgggaaaactttttcgtttgttcaggttttggcaacacataga 

AGTTGGCTGTTCATCTTTGAAACCAGGGGATGCACAGTCTAAATGAATATCTGCATGGGATTTG 

ctatcataatatttactatgcagatgaattcagtgtgaggtcctgtgtccgtactatcctcaaat 

tatttattttatagtgctgagatcctcaaataatctcaatttcaggaggtttcaca 

cctgaagtagacagagtagtgaggtttcattgccctctataaggttctgactagccaatggcat 

catccaattttcttcccaaacctctgcagcatctgcittattgGcaaagggctagtt^ 

ctgcagccattgcggttaaaaaatataagtaggataacttgtaaaacctgcatattgctaatct 

ATAGACACCACAGTTTCTAAATTCTITGAAACCACTTTACTACTTTTTT^ 

CTAAATACTITGTCTGGAGCACAAAACAATAAAAGGTTATCTTATAGTCGTGACTTTAAACTTT 

TGTAGACCACAATTCACTTTITAGTTTTCTTTTACTTAAATCCCATCTGCAGTCT 

TCTCCCAGTAGAGATTGAGTTTGAGCCTGTATATCTATTAAAAATTTCAACTTCCCACATATATT 

TACTAAGATGATTAAGACTTACATTTTCTGCACAGGTCTGCAAAAACAAAAATTATAAACTAGT 

CCATCCAAGAACCAAAGTTTGTATAAACAGGTTGCTATAAGCTTGGTGAAATGAAAATGGAAC 

ATTTCAATCAAACATTTCGTATATAACAATTATTA 

TrATGTCCACCCTTTTAAAAATTATTATTTGAAGTAATTTATTTACA 

TATTTTCTTATAGAGATATTTCTTACAGAAAGCTTTGTAGCAGAATATATT^ 

TrGTAATTTAGGAAAAATGTATAATAAGATAAAATCTATTAAAril-TTCTCCTCTAAAAACTGA 

ATTCAAAGC 
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ACACACAGGCGGCGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTG 

CCTGCTGCTGGCGGCGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCA 

GTCAAGCCCGGCCCGGCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGC 

GAGGTTGAGGAACTGATGGAGGACACGCAGCACAAATTGCGCAGCGCGGTGGAAGAGATGGA 

GGCAGAAGAAGCTGCTGCTAAAGCATCATCAGAAGTGAACCTGGCAAACTTACCTCCCAGCTA 

TCACAATGAGACCAACACAGACACGAAGGTTGGAAATAATACCATCCATGTGCACCGAGAAAT 

TCACAAGATAACCAACAACCAGACTGGACAAATGGTCTTTTCAGAGACAGTTATCACATCTGTG 

GGAGACGAAGAAGGCAGAA.GGAGCCACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCAT 

GTACTGCCAGTTTGCCAGCTTCCAGTACACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGC 

ACCCGGGACAGTGAGTGCTGTGGAGACCAGCTGTGTGTCTGGGGTCACTGCACCAAAATGGCC 

ACCAGGGGCAGCAATGGGACCATCTGTGACAACCAGAGGGACTGCCAGCCGGGGCTGTGCTGT 

GCCITCCAGAGAGGCCTGCTGTTCCCTGTGTGCACACCCCTGCCCGTGGAGGGCGAGCTTTGCC 

ATGACCCCGCCAGCCGGCTTCTGGACCTCATCACCTGGGAGCTAGAGCCTGATGGAGCCTTGGA 

CCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGCCCCACAGCCACAGCCTGGTGTATGTGTGC 

AAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGGGAGATCCTGCTGCCCAGAGAGGTCCCC 

GATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCGCCAGGAGCTGGAGGACCTGGAGAGG 

AGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTGCCGCCGCTGCACTGCTGGGAGGG 

GAAGAGATTTAGATCTGGACCAGGCTGTGGGTAGATGTGCAATAGAAATAGCTAATTTATTTCC 

CCAGGTGTGTGCITrAGGCGTGGGCTGACCAGGCITCTTCCTACATCITCTTCCCAGTAAGT^ 

CCCTCTGGCITGACAGCATGAGGTGTTGTGCATTTGTTCAGCTCCCCCAGGCTGTTCTCCAGGCT 

TCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCAGGAGCAGTTTGCCACCCCTGT 

CCAGATTATTGGCrGCTTTGCCTCTACCAGTTGGCAGACAGCCGTTTGTTCTACATGGCTTTGAT 

AATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTCTGATTGGTTTTGGGGAAATG 

TGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCrGGCAAAAATGCAACAAATGAATTTTCCA 

CGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGCAGATGAAATGTTCTGTTC 

ACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCCTACCTCTGTGCCAGGGCA 

GCATTITCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGGAGGGGGTCATTGTTCTCC 

TCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCCCAAGTCACACAGCTAGT 

GAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCTCTCCACTACCCCACAC 

CAGCCTTGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGATTTTTCTrTTGAGGCA 

TGCACATCTGGAATTAAGGTCAAACTAATTCrGACATCCCTCTAAAAGTAAACTACTGTTAGGA 

ACAGCAGTGTTCTCACAGTGTGGGGCAGCCGTCCTTCTAATGAAGACAATGATATTGACACTGT 

CCCTCTTTGGCAGTTGCATTAGTAACTTTGAAAGGTATATGACTGAGCGTAGCATACAGGTTAA 

CCTGCAGAAACAGTACTTAGGTAATTGTAGGGCGAGGATTATAAATGAAATTTGCAAAATCAC 

ttagcagcaactgaagacaattatcaaccacgtggagaaaatcaaaccgagcagggctgtgtg 

aaacatggttgtaatatgcgacrgcgaacactgaactctacgccactccacaaatgatgttttc 

aggtgtcatggactgttgccaccatgtattcatccagagttcitaaagtttaaagttgcacatg 

attgtataagcatgcittctttgagttttaaattatgtataaacataagt^ 

agcataaatcacttcaactgctcttct 



Figure 16 



GACAAACAGACGACGTGCTGAGCTGCCAGCTTAGTGGAAGCTCTGCTCTGGGTGGAGAGCAGC 

CTCGCnTGGTGACGCACAGTGCTGGGACCCTCCAGGAGCCCCGGGATTGAAGGATGGTGGCG 

GCCGTCCTGCTGGGGCTGAGCTGGCTCTGCTCTCCCCTGGGAGCTCTGGTCCTGGACTTCAACA 

ACATCAGGAGCTCTGCTGACCTGCATGGGGCCCGGAAGGGCTCACAGTGCCTGTCTGACACGG 

ACTGCAATACCAGAAAGTTCTGCCTCCAGCCCCGCGATGAGAAGCCGTTCTGTGCTACATGTCG 

TGGGTTGCGGAGGAGGTGCCAGCGAGATGCCATGTGCTGCCCTGGGACACTCTGTGTGAACGA 

TGTTTGTACTACGATGGAAGATGCAACCCCAATATTAGAAAGGCAGCTTGATGAGCAAGATGG 

CACACATGCAGAAGGAACAACTGGGCACCCAGTCCAGGAAAACCAACCCAAAAGGAAGCCAA 

GTATTAAGAAATCACAAGGCAGGAAGGGACAAGAGGGAGAAAGTTGTCTGAGAACTTTTGACT 

GTGGCCCTGGACTTTGCrGTGCTCGTCATTTTTGGACGAAAAT^ 

GGACAGGTCTGCTCCAGAAGAGGGCATAAAGACACTGCTCAAGCTCCAGAAATCTTCCAGCGT 
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TGCGACTGTGGCCCTGGACTACTGTGTCGAAGCCAATTGACCAGCAATCGGCAGCATGCTCGAT 
TAAGAGTATGCCAAAAAATAGAAAAGCTATAAATATTTCAAAATAAAGAAGAATCCACATTGC 

ATTTGAG 
Figure 17 

ATGGGGCTCTGGGCGCTGTTGCCTGGCTGGGTTTCTGCTACGCrGCTGCTGGCGCTGGCCGCTCT 

GCCCGCAGCCCTGGCTGCCAACAGCAGTGGCCGATGGTGGGGTATTGTGAACGTAGCCTCCTCC 

ACGAACCTGCTTACAGACTCCAAGAGTCTGCAACTGGTACTCGAGCCCAGTCTGCAGCTGTTGA 

GCCGCAAACAGCGGCGCCTGATACGCCAAAATCCGGGGATCCTGCACAGCGTGAGTGGGGGGC 

TGCAGAGTGCCGTGCGCGAGTGCAAGTGGCAGTTCCGGAATCGCCGCTGGAACTGTCCCACTG 

CTCCAGGGCCCCACCTCTTCGGCAAGATCGTCAACCGAGGCTGTCGAGAAACGGCGTTTATCTT 

CGCTATCACCTCCGCCGGGGTCACCCATTCGGTGGCGCGCTCCTGCTCAGAAGGTTCCATCGAA 

TCCTGCACGTGTGACTACCGGCGGCGCGGCCCCGGGGGCCCCGACTGGCACTGGGGGGGCTGC 

AGCGACAACATTGACTTCGGCCGCCTCTTCGGCCGGGAGTTCGTGGACTCCGGGGAGAAGGGG 

CGGGACCTGCGCTTCCTCATGAACCTTCACAACAACGAGGCAGGCCGTACGACCGTATTCTCCG 

AGATGCGCCAGGAGTGCAAGTGCCACGGGATGTCCGGCTCATGCACGGTGCGCACGTGCTGGA 

TGCGGCTGCCCACGCTGCGCGCCGTGGGCGATGTGCTGCGCGACCGCTTCGACGGCGCCTCGCG 

CGTCCTGTACGGCAACCGCGGCAGCAACCGCGCTTCGCGAGCGGAGCTGCTGCGCCTGGAGCC 

GGAAGACCCGGCCCACAAACCGCCCTCCCCCCACGACCTCGTCTACTTCGAGAAATCGCCCAAC 

TTCTGCACGTACAGCGGACGCCTGGGCACAGCAGGCACGGCAGGGCGCGCCTGTAACAGCTCG 

TCGCCCGCGCTGGACGGCTGCGAGCTGCTCTGCTGCGGCAGGGGCCACCGCACGCGCACGCAG 

CGCGTCACCGAGCGCTGCAACTGCACCTTCCACTGGTGCTGCCACGTCAGCTGCCGCAACTGCA 

CGCACACGCGCGTACTGCACGAGTGTCTGTGA 



Figure 18 

AGCAGAGCGGACGGGCGCGCGGGAGGCGCGCAGAGCTTTCGGGCTGCAGGCGCTCGCTGCCGC 

TGGGGAATTGGGCTGTGGGCGAGGCGGTCCGGGCTGGCCTTTATCGCTCGCTGGGCCCATCGTT 

TGAAACTTTATCAGCGAGTCGCCACTCGTCGCAGGACCGAGCGGGGGGCGGGGGCGCGGCGAG 

GCGGCGGCCGTGACGAGGCGCTCCCGGAGCTGAGCGCTTCTGCTCTGGGCACGCATGGCGCGC 

GCACACGGAGTCTGACCTGATGCAGACGCAAGGGGGTTAATATGAACGCCCCTCTCGGTGGAA 

TCTGGCTCTGGCTCCCTCTGCTCTTGACCTGGCTCACCCCCGAGGTCAACTCTTCATGGTGGTAC 

ATGAGAGCTACAGGTGGCTCCTCCAGGGTGATGTGCGATAATGTGCCAGGCCTGGTGAGCAGC 

CAGCGGCAGCTGTGTCACCGACATCCAGATGTGATGCGTGCCATTAGCCAGGGCGTGGCCGAG 

TGGACAGCAGAATGCCAGCACCAGTTCCGCCAGCACCGCTGGAATTGCAACACCCTGGACAGG 

GATCACAGCCTITITGGCAGGGTCCTACTCCGAAGTAGTCGGGAATCTGCCTTrGTTTATGCCAT 

CTCCTCAGCTGGAGTTGTATTTGCCATCACCAGGGCCTGTAGCCAAGGAGAAGTAAAATCCTGT 

TCCTGTGATCCAAAGAAGATGGGAAGCGCCAAGGACAGCAAAGGCATTTTTGATTGGGGTGGC 

TGCAGTGATAACATTGACTATGGGATCAAATTTGCCCGCGCATTTGTGGATGCAAAGGAAAGG 

AAAGGAAAGGATGCCAGAGCCCTGATGAATCTTCACAACAACAGAGCTGGCAGGAAGGCTGTA 

AAGCGGTTCTTGAAACAAGAGTGCAAGTGCCACGGGGTGAGCGGCTCATGTACTCTCAGGACA 

TGCTGGCTGGCCATGGCCGACTTCAGGAAAACGGGCGATTATCTCTGGAGGAAGTACAATGGG 

GCCATCCAGGTGGTCATGAACCAGGATGGCACAGGTTTCACTGTGGCTAACGAGAGGTTTAAG 

AAGCCAACGAAAAATGACCrCGTGTATTTTGAGAATTCTCCAGACTACTGTATCAGGGACCGAG 

AGGCAGGCTCCCTGGGTACAGCAGGCCGTGTGTGCAACCTGACTTCCCGGGGCATGGACAGCT 

GTGAAGTCATGTGCTGTGGGAGAGGCTACGAGACCTCCCATGTCACCCGGATGACCAAGTGTG 

GGTGTAAGTTCCACTGGTGCTGCGCCGTGCGCTGTCAGGACTGCCTGGAAGCTCTGGATGTGCA 

CACATGCAAGGCCCCCAAGAACGCTGACTGGACAACCGCTACATGACCCCAGCAGGCGTCACC 

ATCCACCTTCCCircrACAAGGACTCCATTGGATCrGCAAGAACACTGGACCITrGGGTTCrTTC 

TGGGGGGATATTTCCTAAGGCATGTGGCCITTATCrCAACGGAAGCCCCCrCTTCCTCCCTGGG 

GGCCCCAGGATGGGGGGCCACACGCTGCACCTAAAGCCTACCCTATTCTATCCATCTCCTGGTG 

TTCTGCAGTCATCrCCCCrCCrGGCGAGTTCTCTTTGGAAATAGCATGACAGGCTGTTCAGCCGG 

GAGGGTGGTGGGCCCAGACCACTGTCTCCACCCACCTTGACGTTTCTrCrTTCTAGAGCAGTTG 
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GCCAAGCAGAAAAAAAAGTGTCTCA^ 

?aagaaatccca^ 

g^g^g^ttacctggagaagaatggctitcaatac^ 

TITrCAAGGCATITATTGCCATATTAAAATCTGATGTAACAAGGTGGGGACGTGTGTCCrTTGGT 
ACTATGGTGTGTTGTATCTITGTAAGAGCAAAAGCCTCAGAAAGGGATTGCTTTGCATTACTGT 
CCCCTTGATATAAAAAATCTTrAGGGAATGAGAGTTCCTTCTCACTTAGAATCTGAAGGGAATT 
AAAAAGAAGATGAATGGTCTGGGAATATTCTGTAACTATT^ 

TTAGTGGATGGAATATCAGAAGTATATCTGTACAGATCAAGAAAAAAAGGAAGAATAAAATTC 
CTATATCAT 



Figure 19 

CGGGAGTCTTCGGGGAGCTATGCTGAGACCGGGTGGTGCGGAGGAAGCTGCGCAGCTCCCGCT 

TCGGCGCGCCAGCGCCCCGGTCCCTGTGCCGTCGCCCGCGGCCCCCGACGGCTCCCGGGCTTCG 

GCCCGCCTAGGTCTrGCCTGCCTTCTGCTCCrGCTGCTGCTGACGCTGCCGGCCCGCGTAGACAC 

GTCCTGGTGGTACATTGGGGCACTGGGGGCACGAGTGATCTGTGACAATATCCCTGGTTTGGTG 

AGCCGGCAGCGGCAGCTGTGCCAGCGTTACCCAGACATCATGCGTTCAGTGGGCGAGGGTGCC 

CGAGAATGGATCCGAGAGTGTCAGCACCAATTCCGCCACCACCGCTGGAACTGTACCACCCTG 

GACCGGGACCACACCGTCTTTGGCCGTGTCATGCTCAGAAGTAGCCGAGAGGCAGCTTTTGTAT 

ATGCCATCTCATCAGCAGGGGTAGTCCACGCTATTACTCGCGCCTGTAGCCAGGGTGAACTGAG 

TGTGTGCAGCrGTGACCCCTACACCCGTGGCCGACACCATGACCAGCGTGGGGACTTTGACTGG 

GGTGGCTGCAGTGACAACATCCACTACGGTGTCCGTITrGCCAAGGCCTTCGTGGATGCCAAGG 

AGAAGAGGCTTAAGGATGCCCGGGCCCTCATGAACTTACATAATAACCGCTGTGGTCGCACGG 

CTGTGGGGCGGTTTCTGAAGCTGGAGTGTAAGTGCCATGGCGTGAGTGGTTCCTGTACTCTGCG 

CACCTGCTGGCGTGCACTCTCAGATTTCCGCCGCACAGGTGATTACCTGCGGCGACGCTATGAT 

GGGGCTGTGCAGGTGATGGCCACCCAAGATGGTGCCAACTTCACCGCAGCCCGCCAAGGCTAT 

CGCCGTGCCACCCGGACTGATCTTGTCTACirrGACAACTCTCCAGATTACTGTGTCTTGGACAA 

GGCTGCAGGTTCCCTAGGCACTGCAGGGCGTGTCTGCAGCAAGACATCAAAAGGAACAGACGG 

TTGTGAAATCATGTGCTGTGGCCGAGGGTACGACACAACTCGAGTCACCCGTGTTACCCAGTGT 

GAGTGCAAATTCCACTGGTGCTGTGCTGTACGGTGCAAGGAATGCAGAAATACTGTGGACGTC 

CATACrrGCAAAGCCCCCAAGAAGGCAGAGTGGCTGGACCAGACCTGAACACACAGATACCTC 

-ACTCATCCCTCCAATTCAAGCCTCrCAACTCAAAAGCACAAGATCCTTGCATGCACACCTTCCT 

CGACCCrCCACCCTGGGGTGCrACCGCTTCTATTTAAGGATGTAGAGAGTAATCCATAGGGACC 

ATGGTGTCCTGGCTGGTTCCTTAGCCCTGGGAAGGAGTTGTCAGGGGATATAAGAAACTGTGCA 

AGCTCCCTGATTTCCCGCTCTGGAGATTTGAAGGGAGAGTAGAAGAGATAGGGGGTCTTTAGA 

GTGAAATGAGTTGCACTAAAGTACGTAGTTGAGGCTCGllllllClllCCTTrGCACCAGCTTCC 

CGACACTTCTTGGTGTGCAAGAGGAAGGGTACCTGTAGAGAGCTTCTTTTTGTTTCTACCTGGC 

CAAAGTTAGATGGGACAAAGATGAATGGCATGTCCCTTCTCTGAAGTCCGTTTGAGCAGAACTA 

CCTGGTACCCCGAAAGAAAAATCTTAGGCTACCACATTCTATTATTGAGAGCCTGAGATGTTAG 

CCATAGTGGACAAGGTTCCATTCACATGCTCATATGTTTATAAACTGTGTTTTGTAGAAGAAAA 

AGAATCATAACAATACAAACACACATTCATTCTCTCTTTTTCTCTCTACCATTCTCAA 

TGGACAGCACTGCCTCTTTTGCTTACTTGCTGCCTGTTCAAACTGAGGTGGAATGGAGTGGTTCC 

CATGCTTAACAGATCATTAAAACACCCTAGAACACTCCTAGGATAGATTAATGT 



Figure 20 

GCGCTTCTGACAAGCCCGAAAGTCATTTCCAATCTCAAGTGGACTTTGTTCCAACTATTGGGGG 

CGTCGCTCCCCCTCYTCATGGTCGCGGGCAAACTTCCTCCTCGGCGCCTCTTCTAATGGAGCCCC 

ACCTGCTCGGGGTGCTCCTCGGCCTCCTGCTCGGTGGCACCAGGGTCCTCGCTGGCTACCCAAT 

TTGGTGGTCCCTGGCCCTGGGCCAGCAGTACACATCTCTGGGCTCACAGCCCCTGCTCTGCGGC 

TCCATCCCAGGCCTGGTCGCCAAGCAACTGCGCTTCTGCCGCAATTACATCGAGATCATGCCCG 
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CGTGGCCGAGGGCGTGAAGCTGGGCATCCAGGAGTGCCAGCACCAGTTCCGGGGCCGCCGCT 

GGAACTGCACCACCATAGATGACAGCCTGGCCATCTTTGGGCCCGTCCTCGACAAAGCCACCCG 

CGAGTCGGCCTTCGTTCACGCCATCGCCTCGGCCGGCGTGGCCTTCGCCGTCACCCGCTCCTGC 

GCCGAGGGCACCTCCACCATTTGCGGCTGTGACTCGCATCATAAGGGGCCGCCTGGCGAAGGC 

TGGAAGTGGGGCGGCTGCAGCGAGGACGCTGACTTCGGCGTGTTAGTGTCCAGGGAGTTCGCG 

GATGCGCGCGAGAACAGGCCGGACGCGCGCTCGGCCATGAACAAGCACAACAACGAGGCGGG 

CCGCACGACTATCCTGGACCACATGCACCTCAAATGCAAGTGCCACGGGCTGTCGGGCAGCTGT 

GAGGTGAAGACCTGCTGGTGGGCGCAGCCTGACTTCCGTGCCATCGGTGACTTCCTCAAGGACA 

AGTATGACAGCGCCTCGGAGATGGTAGTAGAGAAGCACCGTGAGTCCCGAGGCTGGGTGGAGA 

CCCTCCGGGCCAAGTACTCGCTCTTCAAGCCACCCACGGAGAGGGACCTGGTCTACTACGAGA 

ACTCCCCCAACTTTTGTGAGCCCAACCCAGAGACGGGTTCCTTTGGCACAAGGGACCGGACTTG 

CAATGTCACCTCCCACGGCATCGATGGCTGCGATCTGCTCTGCTGTGGCCGGGGCCACAACACG 

AGGACGGAGAAGCGGAAGGAAAAATGCCACTGCATCTTCCACTGGTGCTGCTACGTCAGCTGC 

CAGGAGTGTATTCGCATCTACGACGTGCACACCTGCAAGTAGGGCACCAG 



Figure 21 

ATGAGTCCCCGCTCGTGCCTGCGTTCGCTGCGCCTCCTCGTCTTCGCCGTCTTCTCAGCCGCCGC 
GAGCAACTGGCTGTACCTGGCCAAGCTGTCGTCGGTGGGGAGCATCTCAGAGGAGGAGACGTG 
CGAGAAACTCAAGGGCCTGATCCAGAGGCAGGTGCAGATGTGCAAGCGGAACCTGGAAGTCAT 
GGACTCGGTGCGCCGCGGTGCCCAGCTGGCCATTGAGGAGTGCCAGTACCAGTTCCGGAACCG 
GCGCTGGAACTGCTCCACACTCGACTCCTTGCCCGTCTTCGGCAAGGTGGTGACGCAAGGGATT 
CGGGAGGCGGCCTTGGTGTACGCCATCTCTTCGGCAGGTGTGGCCTTTGCAGTGACGCGGGCGT 
GCAGCAGTGGGGAGCTGGAGAAGTGCGGCTGTGACAGGACAGTGCATGGGGTCAGCCCACAG 
GGCrrCCAGTGGTCAGGATGCrcrGACAACATCGCCTACGGTGTGGCCTTCTCACAGTCGTTTG 
TGGATGTGCGGGAGAGAAGCAAGGGGGCCTCGTCCAGCAGAGCCCTCATGAACCTCCACAACA 
ATGAGGCCGGCAGGAAGGCCATCCTGACACACATGCGGGTGGAATGCAAGTGCCACGGGGTGT 
CAGGCTCCTGTGAGGTAAAGACGTGCTGGCGAGCCGTGCCGCCCTTCCGCCAGGTGGGTCACG 
. CAGTGAAGGAGAAGTTTGATGGTGCCAGTGAGGTGGAGCCACGCCGCGTGGGCTCCTCCAGGG 
CACTGGTGCCACGCAACGCACAGTTCAAGCCGCACACAGATGAGGACTTGGTGTACTTGGAGC 
CTAGCCCCGACTTCTGTGAGCAGGACATGCGCAGCGGCGTGCTGGGCACGAGGGGCCGCACAT 
GCAACAAGACGTCCAAGGCCATCGACGGCTGTGAGCTGCTGTGCTGTGGCCGCGGCTTCCACA 
CGGCGCAGGTGGAGCTGGCTGAACGCTGCAGCTGCAAATTCCACTGGTGCTGCTTCGTCAAGTG 
CCGGCAGTGCCAGCGGCTCGTGGAGTTGCACACGTGCCGATGA 



Figure 22 

ATTAATTCTGGCTCCACTTGTTGCTCGGCCCAGGTTGGGGAGAGGACGGAGGGTGGCCGCAGC 

GGGTTCCTGAGTGAATTACCCAGGAGGGACTGAGCACAGCACCAACTAGAGAGGGGTCAGGGG 

GTGCGGGACTCGAGCGAGCAGGAAGGAGGCAGCGCCTGGCACCAGGGCTTTGACTCAACAGA 

ATTGAGACACGTTTGTAATCGCTGGCGTGCCCCGCGCACAGGATCCCAGCGAAAATCAGATTTC 

CTGGTGAGGTTGCGTGGGTGGATTAATTTGGAAAAAGAAACTGCCTATATCTTGCCATCAAAAA 

ACTCACGGAGGAGAAGCGCAGTCAATCAACAGTAAACTTAAGAGACCCCCGATGCTCCCCTGG 

TTTAACiTGTATGCrTGAAAATTATCTGAGAGGGAATAAACATCTTTTCCrTCITCCCTCTCCAG 

AAGTCCATTGGAATATTAAGCCCAGGAGTTGCTTTGGGGATGGCTGGAAGTGCAATGTCTTCCA 

AGTTCITCCTAGTGGCTrrGGCCATATTTTTCTCCITCGCCCAGGTTGTAATTGAAGCCAATTC^ 

GGTGGTCGCTAGGTATGAATAACCCTGTTCAGATGTCAGAAGTATATATTATAGGAGCACAGCC 

TCTCrGCAGCCAACTGGGAGGACTTTCTCAAGGACAGAAGAAACTGTGCCACTTGTATCAGGAC 

CACATGCAGTACATCGGAGAAGGCGCGAAGACAGGCATCAAAGAATGCCAGTATCAATTGCGA 

CATCGACGGTGGAACTGCAGCACTGTGGATAACACCTCrGTTTTTGGCAGGGTGATGCAGATAG 

GCAGCCGCGAGACGGCCTTCACATACGCCGTGAGCGCAGCAGGGGTGGTGAACGCCATGAGCC 

GGGCGTGCCGCGAGGGCGAGCTGTCCACCTGCGGCTGCAGCCGCGCCGCGCGCCCCAAGGACC 
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TGCCGCGGGACTGGCTCTGGGGCGGCTGCGGCGACAACATCGACTATGGCTACCGCTTTGCCAA 

GGAGTTCGTGGACGCCCGCGAGCGGGAGCGCATCCACGCCAAGGGCTCCTACGAGAGTGCTCG 

CATCCTCATGAACCTGCACAACAACGAGGCCGGCCGCAGGACGGTGTACAACCTGGCTGATGT 

GGCCTGCAAGTGCCATGGGGTGTCCGGCTCATGTAGCCTGAAGACATGCTGGCTGCAGCTGGC 

AGACTTCCGCAAGGTGGGTGATGCCCTGAAGGAGAAGTACGACAGCGCGGCGGCCATGCGGrCT 

CAACAGCCGGGGCAAGTTGGTACAGGTCAACAGCCGCTTCAACTCGCCCACCACACAAGACCT 

GGTCTACATCGAGCCCAGCCCTGACTACTGCGTGCGCAATGAGAGCACCGGCTCGCTGGGCAC 

GCAGGGCCGCCTGTGCAACAAGACGTCGGAGGGCATGGATGGCTGCGAGCTCATGTGCTGCGG 

CCGTGGGTACGACCAGTTCAAGACCGTGCAGACGGAGCGCTGCCACTGCAAGTTCCACTGGTG 

CTGCTACGTCAAGTGCAAGAAGTGCACGGAGATCGTGGACCAGTTTGTGTGCAAGTAGTGGGT 

GCCACCCAGCACTCAGCCCCGCTCCCAGGACCCGCTTATTTATAGAAAGTACAGTGATTCTGGT 

TITrGGTTTTTAGAAATATTTTTTAT 

TTACCATCTAAGAACTCTGTGGTTTATTATTAATATTATAATTATTATTTGGCAATAATGGGGGT 

GGGAACCACGAAAAATATTTATTTTGTGGATCITrGAAAAGGTAATACAAGACn^ 

AGTATAGAATGAAGGGGGAAATAACACATACCCTAACTTAGCTGTGTGGGACATGGTACACAT 

CCAGAAGGTAAAGAAATACATTTTCTTTTTCTCAAATATGCCATCATATGGGATGGGTAGGTTC 

CAGTTGAAAGAGGGTGGTAGAAATCTATTCACAATTCAGCITCrATGACCAAAATGAGTTGTAA 

ATTCTCrGGTGCAAGATAAAAGGTCTTGGGAAAACAAAACAAAACAAAACAAACCrCCCTTCC 

CCAGCAGGGCTGCTAGCITGCTTTCrGCATTTTCAAAATGATAATTTACAATGGAAGGACAAGA 

ATGTCATATrCTCAAGGAAAAAAGGTATATCACATGTCTCATTCTCCTCAAATATTCCATTTGCA 

GACAGACCGTCATATTCTAATAGCTCATGAAATTTGGGCAGCAGGGAGGAAAGTCCCCAGAAA 

TTAAAAAATTTAAAACTCTTATGTCAAGATGTTGATTTGAAGCrGTTAT 

AGATTTGTAAAAAGACCCCCAATGATTCTGGACACTAGAT^ 

AACATAAATGAAATATCCTGTATTTTCTTAGGGATACTTGGTTAGTAAATTATAATAGTAGAAA 

TAATACATGAATCCCATTCACAGGTTTCTCAGCCCAAGCAACAAGGTAATTGCGTGCCATTCAG 

CACTGCACCAGAGCAGACAACCTATTTGAGGAAAAACAGTGAAATCCACCTTCCTCTTCACACT 

GAGCCCTCrCTGATTCCTCCGTGTTGTGATGTGATGCTGGCCACGTTTCCAAACGGCAGCTCCAC 

TGGGTCCCCTTTGGTTGTAGGACAGGAAATGAAACATTAGGAGCTCTGCTTGGAAAACAGTTCA 

CTACITAGGGATTTTTGTTTCCrAAAACTTTTATTT^ 

ACAGAACTTGGCTAATGGAATTCACAGAGGTGTTGCAGCGTATCACTGTTATGATCCTGTGTTT 
AGATTATCCACrCATGCTTCrCCTATTGTACTGCAGGTGTACCrrAAAACTGTTCCCAGTGTACT 
TGAACAGTTGCATTTATAAGGGGGGAAATGTGGTTTAATGGTGCCTGATATCTCAAAGTCil 11 
GTACATAACATATATATATATATACATATATATAAATATAAATATAAATATATCTCATTGCAGC 
CAGTGATTTAGATTTACAGCrrACTCTGGGGTTATCTCrcrGTCTAGAGCATTGT^ 

tgcagtccagttgggattattccaaaagttttttgagtcitgAgcttgggctgtggccccg 
gatcataccctgagcacgacgaagcaacctcgtttctgaggaagaagcttgagttctgactcac 

TGAAATGCGTGTTGGGTTGAAGATATCTTTTTTT^ 

CATITCrGTTCACITTGTGGAGAGGGCATTACTTGTTCGTTATAGACATGGACGTTAAGAGATAT 
TCAAAACTCAGAAGCATCAGCAATGTTTCTCTITrCITAGTTCATTCTGCAGAATGGAAACCCAT 

gcctattagaaatgacagtacitAttaattgagtccctaaggaatattcagcccactacataga 

TAGCTTTITrTTI 11 ITllU'lll l llTAATAAGGACACCTCrTTCCAAACAGGCCATCAAATATGT 

TCTTATCrCAGACTTACGTTGTTTTAAAAGTTTGGAAAGATACACATCITITCATACCC 

AGGAGGTTGGGCTTTCATATCACCTCAGCCAACTGTGGCTCTTAATTTATTGCATAATGATATCC 

ACATCAGCCAACrGTGGCTCITTAATlTATTGCATAATGATATTCACATCCCCTCA GTTGCA GTG 

AATTGTGAGCAAAAGATCTTGAAAGCAAAAAGCACTAATTAGTTTAAAATGTCAClirriiGGT 

TTTTATTATACAAAAACCATGAAGTACITITITrATTTGCT 

CTCATGTTTATGAAGAGAGTTGAGTTTAACAATCCrAGCTTTTAAAAGAAACTATT^ 
AATATTCrACATGTCATTCAGATATTATGTATATCTTCrAGCCrTTATTCTGTACITTTAATG 
ATATTTCTGTCITGCGTGATrTGTATATTTCAGTGGTTTAAAAAACAAACATCGAA^ 
CCAAATGGAAG 

Figure 23 

GGCACGAGCGCAGGAGACACAGGCGCTGGCTGCCCCGTCCGCTCTCCGCCTCCGCCGCGCCCTCCTCGCC 
CGGG ATGGGCCCCCCCGCCGCCGCCGGATCCCTCGCCTCCCGGCCGCCGCCGTTGCGCTCGCCGCGCTCG 
CACTGAAGCCCGGGCCCTCGCGCGCCGCGGTTCGCCCCGCAGCCTCGCCCCCTGCCCACCCGGGCGGCCG 
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TAGGGCGGTCACG ATGCTGCCGCCCTTACGCTCCCGCCTCGGGCTGCTGCTGCTGCTGCTCCTGTGCCCG 

GCGCACGTCGGCGGACTGTGGTGGGCTGTGGGCAGCCCCTTGGTTATGGACCCTACCAGCATC^GCAGGA 

AGGCACGGCGGCTGGCCGGGCGGCAGGCCGAGTTGTGCCAGGCTGAGCCGGAAGTGGTGGCAGAGCTAGC 

TCGGGGCG.CCCGGCTCGGGGTGCGAGAGTGCCAGTTCCAGTTCCGCTTCCGCCGCTGGAATTGCTCCAGC 

CACAGCAAGGCCTTTGGACGCATCCTGCAACAGGACATTCGGGAGACGGCCTTCGTGTTGGCCATCACTG 

CGGCCGGCGCCAGCCACGCCGTCACGCAGGCCTGTTCTATGGGCGAGCTGCTGCAGTGCGGCTGCCAGGC 

GCCCCGCGGGCGGGCCCCTCCCCGGCCCTCCGGCCTGCCCGGCACCCCCGGACCCCCTGGCCGCGCGGGC 

TCCCCGGAAGGCAGCGCCGCCTGGGAGTGGGGAGGCTGCGGCGACGACGTGGACTTCGGGGACGAGAAGT 

CGAGGCTCTTTATGGACGCGCGGCACAAGCGGGGACGCGGAGACATCCGCGCGTTGGTGCAACTGCACAA 

CAACGAGGCGGGCAGGCTGGCCGTGCGGAGCCACACGCGCACCGAGTGCAAATGCCACGGGCTGTCGGGA 

TCATGCGCGCTGCGCACCTGCTGGCAGAAGCTGCCTCCATTTCGCGAGGTGGGCGCGCGGCTGCTGGAGC 

GCTTCCACGGCGCCTGACGCGTCATGGGGACCAACGACGGCAAGGCCCTGCTGCCCGCCGTCCGCACGCT 

CAAGCCGCCGGGCCGAGCGGACCTCCTGTACGCCGCCGATTCGCCCGACTTTTGCGCCCCCAACCGACGC 

ACCGGCTCCCCCGGCACGCGCGGTCGCGCCTGCAATAGCAGCGCCCCGGACCTCAGCGGCTGCGACCTGC 

TGTGCTGCGGCCGCGGGCACCGCCAGGAGAGCGTGCAGCTCGAAGAGAACTGCCTGTGCCGCTTCCACTG 

GTGCTGCGTAGTACAGTGCCACCGTTGCCGTGTGCGCAAGGAGCTCAGCCTCTGCCTGTGACCCGCCGCC 

CGGCCGCTAGACTGACTTCGCGCAGCGGTGGCTCGCACCTGTGGGACCTCAGGGCACCGGCACCGGGCGC 

CTCTCGCCGCTCGAGCCCAGCCTCTCCCTGCCAAAGCCCAACTCCCAGGGCTCTGGAAATGGTGAGGCGA 

GGGGCTTGAGAGGAACGCCCACCCAGGAAGGCCCAGGGCGCCAGACGGCCCCGAAAAGGCGCTCGGGGAG 

CGTTTAAAGGACACTGTACAGGCCCTCCCTCCCCTTGGCCTCTAGGAGGAAACAGTTTTTTAGACTGGAA 

AAAAGCCAGTCTAAAGGCCTCTGGATACTGGGCTCCCCAGAACTGCTGGCCACAGGATGGTGGGTGAGGT 

TAGTATGAATAAAGATATTTAAACCAAAAAAAAAAAAAAAAAAAAA 



Figure 24 

CACGCGTCCGGGCCAATCGGGACTATGAACCGGAAAGCGCTGCGCTGCCTGGGCCACCTCITTC 

TCAGCCTGGGCATGGTCTGCCTCCGGATCGGTGGCTTCTCCTCAGTGGTAGCTCTGGGCGCAAC 

GATCATCTGTAACAAGATCCCAGGCCTGGCTCCCAGACAC3CGGGCGATCTGCCAGAGCCGGCC 

CGACGCCATCATCGTCATAGGAGAAGGCTCACAAATGGGCCTGGACGAGTGTCAGTTTCAGTTC 

CGCAATGGCCGCTGGAACTGCTeTGCACTGGGAGAGCGCACCGTCTTCGGGAAGGAGCTCAAA 

GTGGGGAGCCGGGACGGTGCGTTCACCTACGCCATCATTGCCGCCGGCGTGGCCCACGCCATC 

acagctgcctgtacccatggcaacctgXgcgactgtggctgcgacaaagagaagcaaggccag 

TACCACCGGGACGAGGGCTGGAAGTGGGGtGGCTGCTCTGCCGACATCCGCTACGGCATCGGC 

TTCGCCAAGGTCTITGTGGATGCCCGGGAGATCAAGCAGAATGCCCGGACTCTCATGAACTTGC 

ACAACAACGAGGCAGGCCGAAAGATCCTGGAGGAGAACATGAAGCTGGAATGTAAGTGCCAC 

GGCGTGTCAGGCTCGTGCACCACCAAGACGTGCTGGACCACACTGCCACAGTTTCGGGAGCTG 

GGCTACGTGCTCAAGGACAAGTACAACGAGGCCGTTCACGTGGAGCCTGTGCGTGGCAGCCGC 

AACAAGCGGCCCACCTTCGTGAAGATCAAGAAGCCACTGTCGTACCGCAAGCCCATGGACACG 

GACCTGGTGTACATCGAGAAGTCGCCCAACTACTGCGAGGAGGACCCGGTGACCGGCAGTGTG 

GGCACCCAGGGCCGCGCCTGCAAGAAGACGGCTCCCCAGGCCAGCGGCTGTGACCTCATGTGC 

TGTGGGCGTGGCTACAACAGCCACCAGTACGCCCGCGTGTGGCAGTGCAACTGTAAGTTCCACT 

GGTGCTGCTATGTCAAGTGCAACACGTGCAGCGAGCGCACGGAGATGTACACGTGCAAGTGAG 

CCCCGTGTGCACACCACCCrCCCGCTGCAAGTCAGATTGCTGGGAGGACTGGACCGTTTCCAAG 

CTGCGGGCTCCCTGGCAGGATGCTGAGCTTGTCITTTCTGCTGAGGAAGGTACTTTTCCrGGGTT 

TCCTGCAGGCATCCGTGGGGGAAAAAAAATCTCTCAGAACCCTCAACTATrCTGTTCCACACCC 

AATGCTGCTCCACCCTCCCCCAGACACAGCCCAAGTCCCTCCGCGGCTGGAGCGAAGCCTTCTG 

CAGCAGGAACTCTGGACCCCTGGGCCTCATCACAGCAATATTTAACAATITATTCrGATAAAAA 

TAATATTAATTTATTTAATTAAAAAGAATTCTTCCACCTCAAAAAAAAAAAA 

AAAAGGGGGG 
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TCCGCTTACACACCAAGGAAAGTTGGGCTITGAAGAATTCCATCCCCATGGCCACTGGAGGAA 

GAATATTTCNCCCGTCTTGCITACCCATCTCCCCAGTTTTTTGGAATTT^ 

AGAGGATTATGTTTCTTrCAAAGCCTTCTGTGTACATCTGTCITITCACCrGTGTCCTCCAA 

AGCCACAGCTGGTCGGTGAACAATTTCCTGATGACTGGTCCAAAGGCnTACCTGATTTACTCCA 

GCAGTGTGGCAGCTGGTGCCCAGAGTGGTATTGAAGAATGCAAGTATCAGTTTGCCTGGGACC 

GCTGGAACTGCCCTGAGAGAGCCCTGCAGCTGTCCAGCCATGGTGGGCTTCGCAGTGCCAATCG 

GGAGACAGCATTTGTGCATGCCATCAGTTCTGCTGGAGTCATGTACACCCTGACTAGAAACTGC 

AGCCTTGGAGATTTTGATAACTGTGGCTGTGATGACTCCCGCAACGGGCAACTGGGGGGACAA 

GGCTGGCrGTGGGGAGGCTGCAGTGACAATGTGGGCTTCGGAGAGGCGATTTCCAAGCAGTTT 

GTCGATGCCCTGGAAACAGGACAGGATGCACGGGCAGCCATGAACCTGCACAACAACGAGGCT 

GGCCGCAAGGCGGTGAAGGGCACCATGAAACGCACGTGTAAGTGCCATGGCGTGTCTGGCAGC 

TGCACCACGCAGACCTGTTGGCTGCAGCTGCCCGAGTTCCGCGAGGTGGGCGCGCACCTGAAG 

GAGAAGTACCACGCAGCACTCAAGGTGGACCTGCTGCAGGGTGCTGGCAACAGCGCGGCCGCC 

CGCGGCGCCATCGCCGACACCTTTCGCTCCATCTCTACCCGGGAGCTGGTGCACCTGGAGGACT 

CCCCGGACTACTGCCTGGAGAACAAAACGCTAGGGCTGCTGGGCACCGAAGGCCGAGAGTGCC 

TAAGGCGCGGGCGGGCCCTGGGTCGCTGGGAACTCCGCAGCTGCCGCCGGCTCTGCGGGGACT 

GCGGGCTGGCGGTGGAGGAGCGCCGGGCCGAGACCGTGTCCAGCTGCAACTGCAAGTTCCACT 

GGTGCTGTGCAGTCCGCTGCGAGCAGTGCCGCCGGAGGGTCACCAAGTACTTCTGTAGCCGCGC 

AGAGCGGCCGCGGGGGGGCGCTGCGCACAAACCCGGGAGAAAACCCTAAGGGTTTCCTCTGCC 

CCCTCCITTTCCCACTGGTTCTTGGCTTCCTTTAGAGACCCCGGTAATTGTGGAACCTAGGGAAT 

GGGGAACCCGCTCTCCCAGACCTAGGGATCCTGAAAGGGAAAAACTGCAATTTCTCCAAAGCT 

TGCCACTTTCCAGCCTGTTTCCCCAATTCCTCTGTGCTCrCCTAAAGCTCTGTCTGAATCCTCGC 

AGCCACACCTAGGTCTGAAAACTCAGGCTTrGAGTTACTGATCITCCITGGATTAGGAAAACAG 

GTGrrCCTCCTCCCCTCTCCTATCAGCCCTAATCTCTGACCTAGCCTATCAACCCTTAGGCGCTG 

GAAAAACCTTCTCATACACGCAGGACCCAGGTTAACTCAAAGCTTTGCCCTTTTGCCCACTGTC 

TGCTACCAGGGGCTCACCCTCTGCTGCACCTCTCTTCTGCACAGCTCCTCCCCTGCTACTGCTGA 

CCAAATTCCCAGGAATCTTGAATGCTTTCTCTCCTCnrCTCCCTTTCCTTTCCCAAAAAA 

AGGAAACTGGCCCCGGAAAAGCATGTCTTTGGGGTTGGTTCCTAGAGGCAGAGGTTGAAGATG 

GAAGAGGGAGCTCTGGAGTGCTAACTTGAACACCAAGGGTGCTACTCATCCCTATGGTATCATA 

TCATGAATGGACTTTACrAGTGGGGCAATGACTTTCCTAGACAATAACCCGAGGGACTCCAGAT 

ACATACCCCGAAGGTCTAGGAAATACGTTAAGGGCAGATTACAGTCATTTCCTACCCTTTAAAG 

GTAACITCreCCTTeTCCTGACCrACTTCCTCGTAGCAACCAACITrACCTCTTCTTCTCCAAAG 

ATCTITGTTCCTCTGAGCCAAGACTGAGGTAAATAAAGCCACTTTCCTCTTCAGATCCTGGTCTG 

CACCTCTAGA 



Figure 26 

GCGGCCGCGTCGACGGAGGGGCTGCAGCTCCGTCAGCCCGGCAGAGCCACCCTGAGCTCGGTG 

AGAGCAAAGCCAGAGCCCCCAGTCCTTTGCTCGCCGGCTTGCTATCTCTCTCGATCACTCCCTCC 

CTTCCTCCCTCCCTTCCTCCCGGCGGCCGCGGCGGCGCTGGGGAAGCGGTGAAGAGGAGTGGCC 

CGGCCCTGGAAGAATGCGGCTCTGACAAGGGGACAGAACCCAGCGCAGTCTCCCCACGGTTTA 

AGCAGCACTAGTGAAGCCCAGGCAACCCAACCGTGCCTGTCTCGGACCCCGCACCCAAACCAC 

TGGAGGTCCTGATCGATCTGCCCACCGGAGCCTCCGGGCTTCGACATGCTGGAGGAGCCCCGGC 

CGCGGCCTCCGCCCTCGGGCCTCGCGGGTCTCCTGTTCCTGGCGTTGTGCAGTCGGGCTCTAAG 

CAATGAGATTCTGGGCCTGAAGTTGCCTGGCGAGCCGCCGCTGACGGCCAACACCGTGTGCTTG 

ACGCTGTCCGGCCTGAGCAAGCGGCAGCTAGACCTGTGCCTGCGCAACCCCGACGTGACGGCG 

TCCGCGCTTCAGGGTCTGCACATCGCGGTCCACGAGTGTCAGCACCAGCTGCGCGACCAGCGCT 

GGAACTGCTCCGCGCTTGAGGGCGGCGGCCGCCTGCCGCACCACAGCGCCATCCTCAAGCGCG 

GTTTCCGAGAAAGTGCTTTTTCCTTCTCCATGCTGGCTGCTGGGGTCATGCACGCAGTAGCCAC 

GGCCTGCAGCCTGGGCAAGCTGGTGAGCTGTGGCTGTGGCTGGAAGGGCAGTGGTGAGCAGGA 

TCGGCTGAGGGCCAAACTGCTGCAGCTGCAGGCACTGTCCCGAGGCAAGAGTTTCCCCCACTCT 

CTGCCCAGCCCTGGCCCTGGCTCAAGCCCCAGCCCTGGCCCCCAGGACACATGGGAATGGGGT 

GGCTGTAACCATGACATGGACTTTGGAGAGAAGTTCTCTCGGGATTTCTTGGATTCCAGGGAAG 

CTCCCCGGGACATCCAGGCACGAATGCGAATCCACAACAACAGGGTGGGGCGCCAGGTGGTAA 

ctgaaaacctgaagcggaaatgcaagtgtcatggcacatcaggcagctgccagttcaagacat 
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GCTGGAGGGCGGCCCCAGAGTTCCGGGCAGTGGGGGCGGCGTTGAGGGAGCGGCTGGGCCGG 

GCCATCTTCATTGATACCCACAACCGCAATTCIGGAGCCTTCCAGCCCCGTCTGCGTCCCCGTCG 

CCTCTCAGGAGAGCTGGTCTACTTTGAGAAGTCTCCTGACTTCTGTGAGCGAGACGCCACTATG 

GGCTCCCCAGGGACAAGGGGCCGGGCCTGCAACAAGACCAGCCGCCTGTTGGATGGCTGTGGC 

AGCCTGTGCTGTGGCCGTGGGCACAACGTGCTCCGGCAGACACGAGTTGAGCGCTGCCATTGCC 

GCTTCCACTGGTGCTGCTATGTGCTGTGTGATGA.GTGCAAGGTTACAGAGTGGGTGAATGTGTG 

TAAGTGAGGGTCAGCCITACCrrGGGGCTGGGGAAGAGGACTGTGTGAGAGGGGCGCCTTTrC 

AGCCCriTGCTCTGATTrCCITCCAAGGTCACTCrTGGTCCCrGGAAGCrrAAAGTATCTAC 

GAAACAGCTTTAGGGGTGGTGGGGGTCAGGTGGACTCTGGGATGTGTAGCCTTCTCCCCAACA 

ATTGGAGGGTCTTGAGGGGAAGCTGCCACCCCTCTTCTGCTCCTTAGACACCTGAATGGACTAA 

GATGAAATGCACTGTATTGCTCCTCCCACTTCTCAACTCCAGAGCCCCTTTAACCCTGATTCATA 

CTCCTTTTGGCTGGGGAGTCCCTATAGTTTCACCACTCCTCrrCCCTTGAGGGATAACCCCAGGCA 

CTGTTTGGAGCCATAAGATCTGTATCTAGAAAGAGATCACCCACTCCTATGTACTATCCCCAAA 

CTCCTITACTGCAGCCTGGGCTCCCTCTTGTGGGATAATGGGAGACAGTGGTAGAGAGGTTTTT 

CTTGGGAAAGAGACAGAGTGCTGAGGGGCACTCTCCGCTGAATCCTCAGAGAGTTGTCTGTCCA 

GGCCCTTAGGGAAGTTGTCTCCTTCCATTCAGATGTTAATGGGGACCCTCCAAAGGAAGGGGTT 

TTCCCATGACTCITGGAGCCTCTrTTTCCTTCTC 

ATACTGAGACTTGTrCTTGGTTCCTGTTTGAAACTAAAATAAATTAAGTTACTGGAAAAAAAAA 
AAAAAAAAA 



Figure 27 

TAACCCGCCGCCTCCGCTCTCCCCGGCTGCAGGCGGCGTGCAGGACCAGCGGCGGCCGTGCAG 

GCGGAGGACTTCGGCGCGGCTCCTCCTGGGTGTGACCCCGGGCGCGCCCGCCGCGCGACGATG 

AGGGCGCGGCCGCAGGTCTGCGAGGCGCTGCTCITCGCCCTGGCGCTCCAGACCGGCGTGTGCT 

ATGGCATCAAGTGGCTGGCGCTGTCCAAGACACCATCGGCCCTGGCACTGAACCAGACGCAAC 

ACTGCAAGCAGCTGGAGGGTCTGGTGTCTGCACAGGTGCAGCTGTGCCGCAGCAACCTGGAGC 

TCATGCACACGGTGGTGCACGCCGCCCGCGAGGTCATGAAGGCCTGTCGCCGGGCCTTTGCCGA 

CATGCGCTGGAACTG<:TCCTCCATTGAGCTCGCCCCCAACrATTrGCTTGACCTGGAGAGAGGG 

ACCCGGGAGTCGGCCTTCGTGTATGCGCTGTCGGCCGCCACCATCAGGGACGCCATCGCCCGGG 

CCTGCACCTCCGGCGACCTGCCCGGCTGCTCCTGCGGCCCCGTCCCAGGTGAGCCACCCGGGCC 

CGGGAACCGCTGGGGAAGATGTGGGGACAACCTCAGCTACGGGCTCCTCATGGGGGCCAAGTT 

TTCCGATGCTCCrATGAAGGTGAAAAAAACAGGATCCCAAGCCAATAAACTGATGCGTCTACA 

CAACAGTGAAGTGGGGAGACAGGCTC'rGCGCGCCTCTCTGGAAATGAAGTGTAAGTGCCATGG 

GGTGTCrGGCTCCTGCTCCATCCGCACGTGCTGGAAGGGGCTGCAGGAGCTGCAGGATGTGGCT 

GCTGACCTCAAGACCCGATACCTGTCGGCCACCAAGGTAGTGCACCGACCCATGGGCACCCGC 

AAGCACCTGGTGCCCAAGGACCTGGATATCCGGCCTGTGAAGGACTGGGAACITGTTTATTTGC 

AGAGCTCACCTGACTTTTGCATGAAGAATGAGAAGGTGGGCTCCCACGGGACACAAGACAGGC 

AGTGCAACAAGACTTCCAACGGAAGCGACAGCTGCXSACCrTATGTGCTGCGGGCGTGGCTACA 

ACCCCTACACAGACCGCGTGGTCGAGCGGTGCCACTGTAAGTACCACTGGTGTTGCTACGTCAC 

CTGCCGCAGGTGTGAGCGTACCGTGGAGCGCTATGTCTGCAAGTGAGGCCCTGCCCTCCGCCCC 

ACGCAGGAGCGAGGACTTTGCTCAAGGACCCTCAGCAACTGGGGCCGGGGGCCTGGAGACACT 

CCATG<jAGCTCTGCTTGTGAATTCCAGATGCCAG^CATGGGAGGCGGCTTGTGCT1TGCCTTCA 

CTTGGAAGCCACCAGGAACAGAAGGTCTGGCCACCCTGGAAGGAGNGCAGGACATCAAAGGA 

AACCGACAAGATrAAAAATAACTTGGCAGCCrGAGNTCrGGAGTGCCGACAGNNTGGTGTAAG 

GAGCGGGGCTTGGGATCGGTGAGACTGATACAGACTTGACCTTrCAGGGCCACAGAGACCAGC 

CTCCG<jGAAGGGGTCTGCCCGCCITCTTCAGAATGTrCTGCG<}GACCCCCn , GGCCCACCCTGGG 

GTCTGAGCCTGCTGGGCCACCACATGGAATCACTAGCT^ 

NTTGCTTTTTCTTCCTTTGGGATGTTGGAAGCTACAGAAATATTTATAAA 

TGGGGTGGCACTTCTCAATTCCTCITTATATATTTTANATATATAAATATATATGTA^ 

ATGATCrCTAATNTAAAACrAGCTTTTTAAGCAGCTGTATGAAATAAATGCTGAGTGAGCCCCA 

GCCCGCCCCTGCAGTTCCCGGCCTCGTCAAGTGAACTCGGCAGACCCTGGGGCTGGCAGAGGG 

AGCTCTCCAGTTTCCGGGCA 
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GGCGCGGCAAGATGCTGGATGGGTCCCCGCTGGCGCGCTGGCTGGCCGCGGCCTTCGGGCTGA 

Sctgctotcgc^ 

gaSatcctcS^^ 

ccg^ctgaagctgg^ 

CGCTGGTGGAGGCCGTGAGCATGAGTG 

gaSgcac^ 

OTCCTCTATGC^ 

r?rrr^TGCGGAGACAA^ 

TCAA^AAGGATCTCC^\GCOCGTC 

AAraCTGGGGTGGAGACCACCTGCAAG^ 

TGCTOGCGGCAGTTGGCGCCnTTCCATGAGGTGGGCAAGCATCTGAAGCACAAGTATGAGACG 
GCA^'CAAGGTGGC^AGCACCACCAATGAAGCTGCCGGCGAGGCAGGTGCCATCTCCCCACCA 

cggSc^tgcctcgS^^ 

CTGGATGACT^ 

gc^accgtgagSa^a^gtgcgagagcatctgctgtgksccgcggc 

TCGTGACAAGGCCCTGCCAGTGCCAGGTGCGTTGGTGCTGCTATGTGGAGTGCAGGCAGTGCA 
CG^AGCGTCAGGAGGTCT^ 

ca^gotgcaggcaSgcacacggtgto^ 

CGACCAGCCCAGCTGCGTGGGGTACAGGCATTGCACACAGTGTGAATGGGTCTACA^ 
GGGCTGAGT^ 

TGGGGTGCAGGCATTGCACAGAGCATGAATGGGCCTACACCTGCCAAGGCTGAA 

Sagccagccctgctgcacatggcacaggcattgcacac 

GGGCTGAGGCCCTGGGCCCAGTCAGCCCTGCTGCTCAGAGTGCAGGCATTGCACAT 
GAAGCTC1ACACCTGCA 

GCCATGCATGCTAGTATGAGGGGTCTACACCTGCAAGGACTGAGAGGCTTTT 



Figure 29 

AGCCTGCAAAAACCACAGAGGGCAAAGCCAGAAAGATGGAAAGGCACCCACCCATGCAGCTC 
ACCAOTGCC^GGGAGACCCTCITCACAGGGGCTTCTCAAAAGACCTC 

GCATTGCCTCCTTCGGGGTTCCAGAGAA.GCTGGGCTGCGCCAATTTGCCGCTGAACAGCCGCCA 
G^GGAG^JTGTGCAAGAGGAAACCGTACCTGCTGCCGAGCATCCGAGAGGGCGCCCGGCTGGG 

CATrcAGGAGTGCAGGAGCCAGTTCAGACACGAGAGATGGAACTGCATC 

CACTACCGCCCCGATGGGCGCCAGCCCCCTCnTGGCTACGAGCTGAGCAGCGGCACCAAAGA 
GACAGCATTTAITrATGCTGTGATGGCTGCAGGCCTGGTGCATTCTGTGACCAGGTCATGCAGT 
GCAGGCAACATGACAGAGTGTTCCTGTGACACCACCTTGCAG^ 

gSctgg^ctgggggggctgctccg 

TAGATTTCCCCATCGGAAACACCACGGGCAAAGAAAACAAAGTACTATTAGCA 

ataacaatgaagctggaaggcaggctgtcgccaagttgatgtcagtagactgccgctgccacg 

GAGTTTCCGGCTCCTGTGCTGTGAAAACATGCTGGAAAACCA 

CCATTTGTTGAAGGATAAATATGAAAACAGTATCCAGATATCAGACAAAATAAAGAGGAAAAT 

GCGCAGGAGAGAAAAAGATCAGAGGAAAATACCAATCCATAAGGATGATCTGCT 

TAAGTCTCCCAACTACTGTGTAGAAGATAAGAAACTGGGAATCCCAGGGACACAAGGCAGAG 

ATGCAACCGTACATCAGAGGGTGCAGATGGCTGCAACCTCCTCTGCTGTGGCCGAGGTTACAAC 

ACCCATGTGGTCAGGCACGTGGAGAGGTGTGAGTGTAAGTTCATCTGGTGCTGCT 

GCAGGAGGTGTGAAAGCATGACTGATGTCCACACTTGCAAGTAACCACTCCATCCAGCCTTGG 

GCAAGATGCCTCAGCAATATACAATGGCATTGCAACCAGAGAGGTGCCCATCCCTGTGCAGCG 

CTAGTAAAGTTGACTCTTGCAGTGGAATCCC 
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AGTTGAGGGATTGACACAAATGGTCAGGCGGCGGCGGCGGAGAAGGAGGCGGAGGCGCAGGG 

GGGAGCCGAGCCCGCTGGGCTGCGGAGAGTTGCGCTCTCTACGGGGCCGCGGCCACTAGCGCG 

GCGCCGCCAGCCGGGAGCCAGCGAGCCGAGGGCCAGGAAGGCGGGACACGACCCCGGCGCGC 

CCTAGCCACCCGGGTTCTCCCCGCCGCCCGCGCTTCATGAATCGCAAGTTTCCGCGGCGGCGGC 

GGCTGCGGTACGCAGAACAGGAGCCGGGGGAGCGGGCCGAAAGCGGCTTGGGCTCGACGGAG 

GGCACCCGCGCAGAGGTCTCCCTGGCCGCAGGGGGAGCCGCCGCCGGCCGTGCCCCTGGCAGC 

CCCAGCGGAGCGGCGCCAAGAGAGGAGCCGAGAAAGTATGGCTGAGGAGGAGGCGCCTAAGA 

AGTCCCGGGCCGCCGGCGGTGGCGCGAGCTGGGAACTTTGTGCCGGGGCGCTCTCGGCCCGGC 

TGGCGGAGGAGGGCAGCGGGGACGCCGGTGGCCGCCGCCGCCCGCCAGTTGACCCCCGGCGAT 

TGGCGCGCCAGCTGCTGCTGCrGCTTTGGCTGCTGGAGGCTCCGCTGCTGCTGGGGGTCCGGGC 

CCAGGCGGCGGGCCAGGGGCCAGGCCAGGGGCCCGGGCCGGGGCAGCAACCGCCGCCGCCGC 

CTCAGCAGCAACAGAGCGGGCAGCAGTACAACGGCGAGCGGGGCATCTCCGTCCCGGACCACG 

GCTATTGCCAGCCCATCTCCATCCCGCTGTGCACGGACATCGCGTACAACCAGACCATCATGCC 

CAACCTGCTGGGCCACACGAACCAGGAGGACGCGGGCCTGGAGGTGCACCAGTTCTACCCTCT 

AGTGAAAGTGCAGTGTTCCGCTGAGCTCAAGTTCTTCCTGTGCTCCATGTACGCGCCCGTGTGC 

ACCGTGCTAGAGCAGGCGCTGCCGCCCTGCCGCTCCCTGTGCGAGCGCGCGCGCCAGGGCTGC 

GAGGCGCTCATGAACAAGTTCGGCTTCCAGTGGCCAGACACGCTCAAGTGTGAGAAGTTCCCG 

gtgcacggcgccggcgagctgtgcgtgggccagaacacgtccgacaagggcaccccgacgccc 

tcgctgctrccagagttctggaccagcaaccctcagcacggcggcggagggcaccgtggcggc 

ttcccggggggcgccggcgcgtcggagcgaggcAagttctcctgcccgcgcgccctcaaggtg 

ccctcctacctcaactaccacttcctgggggagaaggactgcggcgcaccttgtgagccgacca 

aggtgtatgggctcatgtacttcgggcccgaggagctgcgcttctcgcgcacctggattggcat 

ttggtcagtgctgtgctgcgcctccacgctcttcacggtgcttacgtacctggtggacatgcgg 

cgcttcagctacccggagcggcccatcatctrcttgtccggctgttacacggccgtggccgtgg 

cctacatcgccggcttcctcctggaagaccgagtggtgtgtaatgacaagttcgccgaggacgg 

ggcacgcactgtggcgcagggcaccaagaagGagggctgcaccatcctcttcatgatgctcta 

ctrcttcagcatggccagctccatctggtgggtgatcctgtcgctcacctggttcctggcggctg 

GCATGAAGTGGGGCCACGAGGCCATCGAAGCCAACTCACAGTATTTTCACCTGGCCGCCTGGG 

ctgtgccggccatcaagaccatcaccatcctggcgctgggccaggtggacggcgatgtgctga 

gcggagtgtgcttcgtggggcttaacaacgtggacgcgctgcgtggcttcgtgctggcgcccct 

cttcgtgtaccrrgtttatcggcacgtccittctgcrggccggcitrgtgtcgctgttccgcatgcg 

caccatcatgaagcacgatggcaccaagaccgagaagctcjgagaagctcatggtgcgcattgg 

cgtcrrcagcgtgctgtacactgtgccagccaccatcgtcatcgcctgctacttctacgagcag 

gccttccgggaccagtgggaacgcagctgggtggcccagagctgcaagagctacgctatcccc .«,- 

tgccctcacctccaggcggggggaggcgccccgccgcacccgcccatgagcccggacttcacg 

GTCTTCATGATTAAGTACCrrTATGACGCTGATCGTGGGCATCACGTCGGGCTTCTGGATCTGGTC 

CGGCAAGACCCTCAACTCCTGGAGGAAGTTCTACACGAGGCTCACCAACAGCAAACAAGGGGA 

GACTACAGTCTGAGACCCGGGGCTCAGCCCATGCCCAGGCCTCGGCCGGGGCGCAGCGATCCC 

CCAAAGCCAGCGCCGTGGAGTTCGTGCCAATCCTGACATCTCGAGGTTTCCTCACTAGACAACT 

CTCTTTCGCAGGCTCCTTTGAACAACrCAGCTCCTGCAAAAGCTTCCGTCCCTGAGGCAAAAGG 

ACACGAGGGCCCGACTGCCAGAGGGAGGATGGACAGACCTCTTGCCCTCACACTCTGGTACCA 

GGACTGTTCGCTTTTATGATTGTAAATAGCCTGTGTAAGATTTTTGTAAGTATATT^ 

ATGACGACCGATCACGCGTTTTTCTTTTTCAAAAGTTTTTAATTA 

TGAGGCTITrCCTTCTTGCCCTTTTCGGAGTATTGCAAAGGAGCTAAAACTGGTGTGCAACCGC 

ACAGCGCTCCTGGTCGTCCTCGCGCGCCTCTCCCTACCACGGGTGCTCGGGACGGCTGGGCGCC 

AGCTCCGGGGCGAGTTCAGCACTGCGGGGTGCGACTAGGGCTGCGCTGCCAGGGTCACTTCCC 

GCCTCCTCCTTTTGCCCCCTCCCCCTCCITCTGTCCCCTCCCnTCTITCCTGG 

GGTCTrAAGGTACAGAACTCCACAAACCTTCCAAATCTGGAGGAGGGCCCCCAtACATTACAAT 

TCCTCCCTTGCTCGGCGGTGGATTGCGAAGGCCCGTCCCTTCGACTTCCTGAAGCTGGATTTTTA 

ACTGTCCAGAACTTTCCTCCAACTTCATGGGGGCCCACGGGTGTGGGCGCTGGCAGTCTCAGCC 

TCCCTCCACGGTCACCTTCAACGCCCAGACACTCCCrTCTCCCACCTTAGTTGGTTACAGGGTGA 

GTGAGATAACCAATGCCAAACTTTTTGAAGTCTAATTTTTGAGGGGTGAGCTCATTTCATTCT 

AGTGTCTAAAACCTGGTATGGGTTTGGCCAGCGTCATGGAAAGATGTGGTTACTGAGATTTGGG 

AAGAAGCATGAAGCTTTGTGTGGGTTGGAAGAGACTGAAGATATGGGTTATAAAATGTT AATT 

CTAATTGCATACGGATGCCTGGCAACCTTGCCTTTGAGAATGAGACAGCCTGCGCTTAGATTTT 

ACCGGTCTGTAAAATGGAAATGTTGAGGTCACCTGGAAAGCTTTGTTAAGGAGTTGATGTTTGC 
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TTTCOTAACAAGACAGCAAAACGTAAACAGAAATTGAAAACTTGAAGGATATTTCAGTGTCAT 

GGACTTCCTCAAAATGAAGTGCTATTTTCTTATTITTAATCAAATAACTAGACATATATCAG^ 

ACTITAAAATGTAAAAGTrGTACACTTTCAACATTTTATTACGATTATTATTCAG^ 

TGAGGGGGGAACAATTCACACCACCAATAATAACCTGGTAAGATTTCAGGAGGTAAAGAAGGT 

GGAATAATTGACGGGGAGATAGCGCCTGAAATAAACAAAATATGGGCATGfCATGCTAAAGGG 

AAAATGTGTGCAGGTCTACTGCATTAAATCCTGTGTGCTCCTCTTTTGGATTTACAGAAATGTGT 

CAAATGTAAATCTTTCAAAGCCATTTAAAAATATTCACTTTAGTTCTCTGTGAAGAAGAGGAGA 

AAAGCAATCCTCCTGATTGTATTGTTITAAACTTrAAGAATTTATCAAAATGCCGGTACTTAGG 

ACCTAAATITATCrATGTCrGTCATACGCTAAAATGATATTGGTCTTrGAATTTGGTATACATrr 

ATrCTGTrCACTATCACAAAATCATCTATATTTATAGAGGAATAGAAGTTTATATATA^TAATA 

CCATATTITrAATTTCACAAATAAAAAATTCAAAGTTTTGTACAAAATTATATC 

TGAAAATAATAGAGCTTGAGCTGTCTGAACTA 

ACAGTGTAAAAATTCA 



Figure 31 

CGAGTAAAGTTTGCAAAGAGGCGCGGGAGGCGGCAGCCGCAGCGAGGAGGCGGCGGGGAAGA 

AGCGCAGTCTCCGGGTTGGGGGCGGGGGCGGGGGGGGCGCCAAGGAGCCGGGTGGGGGGCGG 

CGGCCAGCATGCGGCCCCGCAGCGCCCTGCGCCGCCTGCTGCTGCCGCTGCTGCTGCTGCCCGC 

CGCCGGGCCGGCCCAGTTCCACGGGGAGAAGGGCATCTCCATCCCGGACCACGGCTTCTGCCA 

GCCCATCTCCATCCCGCTGTGCACGGACATCGCCTACAACCAGACCATCATGCCCAACCTTCTG 

GGCCACACGAACCAGGAGGACGCAGGCCTAGAGGTGCACCAGTTCTATCCGCTGGTGAAGGTG 

CAGTGCTCGCCCGAACTGCGCTTCTTCCTGTGCTCCATGTACGCACCCGTGTGCACCGTGCTGG 

AACAGGCCATCCCGCCGTGCCGCTCTATCTGTGAGCGCGCGCGCCAGGGCTGCGAAGCCCTCAT 

GAACAAGTTCGGTTTTCAGTGGCCCGAGCGCCTGCGCTGCGAGCACTTGCCGCGCCACGGCGCC 

gagcagatctgcgtcggccagaaccactccgaggacggAgctcccgcgctactcaccaccgcg 
ccgccgccgggactgcagccgggtgccgggggcaccccgggtggcccgggcggcggcggcgct 

CCCCCGCGCTACGCCACGCTGGAGCACCCCTTCCACTGCCCGCGCGTCCTCAAGGTGCCATCCT 

ATCTCAGCTACAAGTTTCTGGGCGAGCGTGATTGTGCTGCGCCCTGCGAACCTGCGCGGCCCGA 

TGGTTCCATGTTCITCTCACAGGAGGAGACGCGTTTCGCGCGGCTCTGGATCCTCACCTGGTCG 

GTGCTGTGCTGCGCTTCCACCITCTrCACTGTCACCACGTACTTGGTAGACATGCAGCGCTTCCG 

CTACCCAGAGCGGCCTATCATTTTTCTGTCGGGCTGCTACACCATGGTGTCGGTGGCCTACATC 

GCGGGCTTCGTGCTCCAGGAGCGCGTGGTGTGCAACGAGCGCTTCTCCGAGGACGGTTACCGC 

ACGGTGGTGCAGGGCACCAAGAAGGAGGGCTGCACCATCCTCTTCATGATGCTCTACTTCTTCA 

GCATGGCCAGCTCCATCTGGTGGGTCATCCTGTCGCTCACCTGGTTGCTGGCAGCCGGCATGAA 

GTGGGGCCACGAGGCCATCGAGGCCAACTCTCAGTACTTCCACCTGGCCGCCTGGGCCGTGCCG 

GCCGTCAAGACCATCACCATCCTGGCCATGGGCCAGATCGACGGCGACCTGCTGAGCGGCGTG 

TGCTTCGTAGGCCTCAACAGCCTGGACCCGCTGCGGGGCTTCGTGCTAGCGCCGCTCTTCGTGT 

ACCrGTTCATCGGCACGTCCTTCCTCCTGGCCGGCTTCGTGTCGCTCTTCCGCATCCGCACCATC 

ATGAAGCACGACGGCACCAAGACCGAAAAGCTGGAGCGGCTCATGGTGCGCATCGGCGTCTTC 

TCCGTGCTCTACACAGTGCCCGCCACCATCGTCATCGCTTGCTACTTCTACGAGCAGGCCTTCCG 

CGAGCACTGGGAGCGCTCGTGGGTGAGCCAGCACTGCAAGAGCCTGGCCATCCCGTGCCCGGC 

GCACTACACGCCGCGCATGTCGCCCGACTTCAGGGTCTACATGATCAAATACCTCATGACGCTC 

ATCGTGGGCATCACGTCGGGCTTCTGGATCTGGTCGGGCAAGACGCTGCACTCGTGGAGGAAG 

TTCTACACTCGCCTCACCAACAGCCGACACGGTGAGACCACCGTGTGAGGGACGCCCCCAGGC 

CGGAACCGCGCGGCGCTTTCCrCCGCCCGGGGTGGGGCCCCTACAGACrCCGTATTTTATTTTTT 

TAAATAAAAAACGATCGAAACCATTTCACTTTTAGGTTGCTTTTTAA 

AACACCCCC 
Figure 32 

GCCGCTCCGGGTACCTGAGGGACGCGCGGCCGCCCGCGGCAGGCGGTGCAGCCCCCCCCCACC 

CCTTGGAGCCAGGCGCCGGGGTCTGAGGATAGCATTTCTCAAGACCTGACTTATGGAGCACTTG 

TAACCTGAGATATTTCAGTTGAAGGAAGAAATAGCTCTTCTCCrAAGATGGAATCTGTGGTTTG 
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GGAATGTGGTTGATCAACTTGATATGTTGGCCAAATGTGCCCCATGTAATAAAATGAAAAGAA 

GAGACAAGATGATGTCATTTTCCCATATTGTGAAACCAAAAACAAACGCCITTTGTGAGACCAA 

GCrAACAAACCTCTGACGGTGCGAAGAGTATTTAACrGTTTGAAGAATTTAACAGTAAGATACA 

GAAGAAGTACCTTCGAGCTGAGACCTGCAGGTGTATAAATATCTAAAATACATATTGAATAGG 

CCTGATCATCTGAATCTCCTTCAGACCCAGGAAGGATGGCTATGACITGGATTGTCrTCTCTCTT 

TGGCCCTTGACTGTGTrCATGGGGCATATAGGTGGGCACAGTTTGTTTTCTTGTGAACCTATTAC 

CTTGAGGATGTGCCAAGATTTGCCTTATAATACTACCITCATGCCTAATCrTCTGAATCATTATG 

ACCAACAGACAGCAGarTTGGCAATGGAGCCATTCCACCCTATGGTGAATCTGGATTGTTCTCG 

GGATTTCCGGCCTTTTCITTGTGCACTCrACGCTCCTATTTGTATGGAATATGGACGTGTCACAC 

TTCCCTGTCGTAGGCTGTGTCAGCGGGCTTACAGTGAGTGTTCGAAGCTCATGGAGATGTTTGG 

TGTTCCTTGGCCTGAAGATATGGAATGCAGTAGGTTCCCAGATTGTGATGAGCCATATCCTCGA 

CTTGTGGATCTGAATTTAGCTGGAGAACCAACTGAAGGAGCCCCAGTGGCAGTGCAGAGAGAC 

TATGGTTTTTGGTGTCCCCGAGAGTTAAAAATTGATCCrGATCrGGGTTATTCTTTTCTGCATGT 

GCGTGATTGTTCACCTCCITGTCCAAATATGTACrTCAGAAGAGAAG AACTGTC ATTTGCTCGCT 

ATITCATAGGATTGATTrCAATCATTTGCCrCTCGGCCACATTGTTTAClillliAACllilllGA 

TTGATGTCACAAGATTCCGTTATCCTGAAAGGCCrATTATATTTTATGCAGTCTGCTACATGATG 

GTATCCTTAATTTTCTTCATTGGATTITTGCTTGAAGATCGAGTAGCCrGCAATGCATCC 

TGCACAATATAAGGCTTCCACAGTGACACAAGGATCTCATAATAAAGCCTGTACCATGCITTTT 

ATGATACTCTATTTTTTTACTATGGCTGGCAGTGTATGGTGGGTAATTCTTACCATCACATGGTT 

TTTAGCAGCTGTGCCAAAGTGGGGTAGTGAAGCTATTGAGAAGAAAGCATTGCTGTTTCACGCC 

AGTGCATGGGGGATCCCCGGAACTCTAACCATCATCCITTTAGCGATGAATAAAATTGAAGGTG 

ACAATATTAGTGGCGTGTGTTTTGTTGGCCTCTACGA 

GCTCCCCTCTGCCTGTATGTGGTAGTTGGGGTTTCrCrCCTCnTAGCTGGCATTATATCCCTAAA 
CAGAGTTCGAATTGAGATTCCATTAGAAAAGGAGAACCAAGATAAATTAGTGAAGTTTA TGAT 
CCGGATCGGTGTTTTCAGCATTCTTTATCTCGTACCACT 

TGAGCAAGCTTACCGGGGCATCTGGGAAACAACGTGGATACAAGAACGC TGCA GAGAATATCA 

CATTCCATGTCCATATCAGGTTACTCAAATGAGTCGTCCAGACITGATTCTCITTCTGATGAAAT 

ACCTGATGGCrCTCATAGTTGGCATTCCCTCrGTATTTTGGGTTGGAAGCAAAAAGACATGCTTT 

GAATGGGCCAGTTTTTTTCATGGTCGTAGGAAAAAAGAGATAGTGAATGAGAGCCGACAGGTA 

CTCCAGGAACCTGATTTTGCTCAGTCTCTCCrGAGGGATCCAAATACTCCTATCATAAGAAAGT 

CAAGGGGAACTTCCACTCAAGGAACATCCACCCATGCTTCTTCAAGTCAGGTGGCTATGGTGGA 

TGATCAAAGAAGCAAAGCAGGAAGCATCCACAGCAAAGTGAGCAGCTACCACGGCAGCCTCC 

ACAGATCACGTGATGGCAGGTACACGCCCTGCAGTTACAGAGGAATGGAGGAGAGACTACCTC 

ATGGCAGCATGTGACGACTAACAGATCACTCCAGGCATAGTAGTTCTCATCGGCTCAATGAACA 

GTCACGACATAGCAGCATCAGAGATCTCAGTAATAATCCCATGACTCATATCACACATGGCACC 

AGCATGAATCGGGTT*ATTGAAGAAGATGGAACCAGTGCTTAAT^ 

TCTTGTGCTGTTTAAAAAGCAGATTTTATTCT 

GTTAACATGCTTTCAGTCAAGTACAGATTGTGTCCACTGGAAAGGTAAATGATTGClll'l'l'lATA 
TTGCATCAAACITGGAACATCAAGGCATCCAAAACACrAAGAATTCTATCATCACAAAAATAAT 
TCGTCTTTCTAGGTTATGAAGAGATAATTATTTGTCTGGTAAGCATTTTTATAAACCCACT 
TTATATTTAGAAAAATCCrAAATGTGTGGTGACT^ 

TAGTTGTGAGATAACATTCTGGTAGCTCAGTTAATAAAACAATTTCAGAATTAAAGAAATTTTC 

TATGCAAGGTTTACITCrCAGATGAACAGTAGGACTTTGTAGTTTTATTTCCACT 

AAGAACTGTGTTTTTAAACTGTAGGAGAATTTAATAAATCAGCAAGGGTATTT^ 

ATAAAAGTGCAACAGAAGAATTTGATTAGTCTATGAAAGGTTCTCrrAAAATTCTATCGAAATA 

ATCTTCATGCAGAGATATTCAGGGTTTGGATTAGCAGTGGAATAAAGAGATGGGCATTGTTTCC 

CCnATAATTGTGCTGTTTTTATAACTTTTC 

ATCCATATGCATGATGGAAAAATTTTAATTTGTAGCCATC^ 

ATAGAGAACTTAATGTTCAAAATTTGCTTTGTGGAGGCATGTAATAAGATAAACATCATACATT 
ATAAGGTAACCACAATTACAAAATGGCAAAACA 

Figure 33 

GCTGCGCAGCGCTGGCTGCTGGCTGGCCTCGCGGAGACGCCGAACGGAGGCGGCCGGCGCCGG 
CTTGTGGGCTCGCCGCCTGCAGCCATGACCCTCGCAGCCTGTCCCTCGGCCTCGGCCCGGGACG 
TCTAAAATCCCACACAGTCGCGCGCAGCTGCTGGAGAGCCGGCCGCTGCCCCCTCGTCGCCGCA 
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TCACACTCCCGTCCCGGGAGCTGGGAGCAGCGCGGGCAGCCGGCGCCCCCGTGCAAACTGGGG 
GTGTCTGCCAGAGCAGCCCCAGCCGCTGCCGCTGCTACCCCCGATGCTGGCCATGGCCTGGCGG 
-GGCGCAGGGCCGAGCGTCCCGGGGGCGCCCGGGGGCGTCGGTCTCAGTCTGGGGTTGCTCCTG 
CAGTTGCTGCTGCTCCTGGGGCCGGCGCGGGGCTTCGGGGACGAGGAAGAGCGGCGCTGCGAC 
CCCATCCGCATCTCCATGTGCCAGAACCTCGGCTACAACGTGACCAAGATGCCCAACCTGGTTG 
GGCACGAGCTGCAGACGGACGCCGAGCTGCAGCTGACAACTTTCACACCGCTCATCCAGTACG 
GCTGCTCCAGCCAGCTGCAGTTCITCCTTTGTTCTGtTTATGTGCCAATGTGCACAGAGAAGATC 
AACATCCCCATTGGCCCATGCGGCGGCATGTGTCTTTCAGTCAAGAGACGCTGTGAACCCGTCC 
TGAAGGAATTTGGATTTGCCTGGCCAGAGAGTCTGAACTGCAGCAAATTCCCACCACAGAACG 
ACCACAACCACATGTGCATGGAAGGGCCAGGTGATGAAGAGGTGCCCTTACCTCACAAAACCC 
CCATCCAGCCTGGGGAAGAGTGTCACTCTGTGGGAACCAATTCTGATCAGTACATCTGGGTGAA 
AAGGAGCCTGAACTGTGTGCTCAAGTGTGGCTATGATGCTGGCTTATACAGCCGCTCAGCCAAG 
GAGTTCACTGATATCTGGATGGCTGTGTGGGCCAGCCTGTGTTTCATCTCCACTGCCTTCACAGT 
ACTGACCTTCCTGATCGATTCITCTAGGTTTTCCTACCCrGAGCGCCCCATCATATTTCTCAGTA 
TGTGCrATAATATTTATAGCATTGCTTATATTGTCAGGCTGACTGTAGGCCGGGAAAGGATATC 
CTGTGATTTTGAAGAGGCAGCAGAACCTGTTCTCATCCAAGAAGGACTTAAGAACACAGGATG 
TGCAATAATTTTCTTGCTGATGTACTTTTTTGGAATGGCCAGCTCCA 

CACTCACTTGGTTTTTGGCAGCAGGACTCAAATGGGGTCATGAAGCCATTGAAATGCACAGCTC 

TTATTTCCACATTGCAGCCTGGGCCATCCCCGCAGTGAAAACCATTGTCATCTTGATTATGAGA 

CTGGTGGATGCAGATGAACTGACTGGCTTGTGCTATGTTGGAAACCAAAATCTCGATGCCCTCA 

CCGGGTTCGTGGTGGCTCCCCTCITrACTTATITGGTCATTGGAACnTTGTTCATTGCTGCAGOT 

TTGGTGGCGTTGTTCAAAATTCGGTCAAATCITCAAAAGGATGGGACAAAGACAGACAAGTTA 

GAAAGACTGATGGTCAAGATTGGGGTGTTCTCAGTACTGTACACAGTTCCTGCAACGTGTGTGA 

TTGCCTGTTATTTTTATGAAATCTCCAACTGGGCACTTTTTCGGTATTCrGCAGATGAT^ 

ATGGCTGTTGAAATGTTGAAAACITTTATGTCTTTGTTGGTGGGCATCACTTCAGGCATGTGGAT 

TTGGTCTGCCAAAAGTCrTCACACGTGGCAGAAGTGTTCCAACAGATTGGTGAATTCTGGAAAG 

GTAAAGAGAGAGAAGAGAGGAAATGGTTGGGTGAAGCCTGGAAAAGGCAGTG AGAC TGTGGT 

ATAAGGCTAGTCAGCCTCCATGC1TTCTTCATTTTGAAGGGGGGAATGCCAGCATTTTGGAGGA 

AATTCTACTAAAAGTTTTATGCAGTGAATCTCAGTTTGAACAAACTAGCAACAATTAAGTGACC 

CCCGTCAACCCACTGCCTCCCACCCCGACCCCAGCATCAAAAAACCAATGATTTTGCTGCAGAC 

TTTGGAATGATCCAAAATGGAAAAGCCAGTTAGAGGCITTCAAAGCTGTGAAAAATCAAAACG 

TTGATCACTTTAGCAGGTTGCAGCTTGGAGCGTGGAGGTCCTGCCTAGATTCCAGGAAGTCCAG 

GGCGATACrGTTTTCCCCTGCAGGGTGGGATTTGAGCTGTGAGTTGGTAACTAGCAGGGAGAAA 

TATTAACTTTTTTAACCCTTTACCATTTTAAATACTAACTGGGT 

TATAAACACTGGAAACGCrGGG-TTCAGAAAAGTGTTACAAGAGTTTTATAGTTTGGCTGATGTA 

ACATAAACATCTTCTGTGGTGCGCTGTCTGCTGTTTAGAACTTTGTGGACTGCACTCCCAAGAA 

GTGGTGTTAGAATCTTTCAGTGCCITrGTCATAAAACAGTTATTTGAACAAACAAAAGTACTGT 

ACTCACACACATAAGGTATCCAGTGGATTTTTCTTCTCTC 

CTCTTCTrGGCTGCTGCTGTTTTCTTCATTTTATGTTAATGACTCAAAAAA 

TTTTTGTACTGCAGCATGCTTAAAGAGGGGAAAAGGAAGGGTGATTCACT1TCTGACAATCACT 

TAATTCAGAGGAAAATGAGATTTACTAAGTTGACTTACCTGACGGACCCCAGAGACCTATTGCA 

TTGAGCAGTGGGGACITAATATATTTTACTTGTGTGATTGCATCTATGCAGACGCCA GTCTG GA 

AGAGCTGAAATGTTAAGTTTCITGGCAACTTTGCATTCACACAGATTAGCTGTGTAAIUlll'GTG 

TGTCAATTACAATTAAAAGCACATTGTTGGACCATGACATAGTATACTCAACTGACTTTAAAAC 

TATGGTCAACTTCAACTTGCATTCTCAGAATGATAGTGCCTITAAAATTTTTT^ 

CATAAGAATGTTATCAGAATCTGGTCTACTTAGGACAATGGAGACTTTTTCAGTTTTATAAAGG 

GAACTGAGGACAGCTAATCCAACTACTTGGTGCTGTAATTGTTTCCTAGTAATTGGCAAAGGCT 

CCTrGTAAGATTTCACTGGAGGCAGTGTGGCCrGGAGTATTTATATGGTGCTTAATGAATCTCC 

AGAATGCCAGCCAGAAGCCTGATTGGTTAGTAGGGAATAAAGTGTAGACCAT ATGA AATGAAC 

TGCAAACTCTAATAGCCCAGGTCTTAATTGCCTTTAGCAGAGGTATCCAAAGCTTTTAAAATTT 

ATGCATACGTTCITCACAAGGGGGTACCCCCAGCAGCCTCTCGAAAATTGCACTTCTCTTAAAA 

CTGTAACTGGCCTTTCTCTTACCITGCCTTAGGCCTTCTAATCATGAGATCTTGGGGACAAATTG 

ACTATGTCACAGGTTGCTCTCCTTGTAACTCATACCTGTCrGCTTCAGCAACTGCTTTGCAATGA 

CATTTATITATTAATTCATGCCTTAAAAAAAT^ 

TTCAATCACACTTTGTGGAAAAACATTTCCAGGGACTCAAAATTCCAAAAAGGTGGTCAAATTC 
TGGAAGTAAGCATTTCCTCTITTTTAAAAATTTGGTTTGAGCCTTATGCCCATAG 
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rrrTTTPTTCTTTCCTTTTTGTTTTTGTGTGGTTCTTG^ 



G 
A 

ACGCAC 



ATTG ATCCCC AGAAGAACCJ A 11 1 lAAUlLtAitmAiAbi^Ai X^^VWA 
ATGTGAACTCCCTGTGCTGCTTGTAGACAGTTCCCATAACTGTCCACGGCCCTGGAGC.^^^^ 

?GGCCCAGGCAGGGGAAGGAAGACCAGGCGGTAGGGGACTGGTCITGCT 
GTTTGT^^TCCA 

aaSca^cTtg^a^ 

AAVJUL,avju ^J;^yf\ r ^ rr ^^ . . p.-rrrrrr; a r.rr A rrrGTTTTGACAATGCT 



mGGGTGAATC^GCACCTG 
T£tGGCCA™ 

tSagaca^^ 

AGTGACCTTCAATCAACTCCATCACTTC^ 

GATGGGGACGCACCCAGGTCTGCTGAACTTTACTGTTCCTCT 
G^TCCC^ 

CCTGGGTCTCTCTACITITAACACATCTCTCATC^ 
AGGTGGTTTTAACAGAAAGCATCAGCTCTGCTrCGTGACAGTCTCT 

gactatgaSagtaggccacaaggacatgggcccacacatctgctt^ 

GGGCTTGGGGTATTCCATGTGACnTGTATAGGTATATTTGAGGACAGCATCTTGCTAGAGAAA^ 

GGTCAGGGTTGTITTTCTITCTCTGAAACCTACAGTAAATGGGTATGATTGTAGC^ 

ATCCCITGGCCTCCAGAGATTAAACATGGTGCAATGGCACCT 

GA^CCmCTCCTGCTrCATATAGGCCAAACCTCAGGGCAAGGGAACATGGGGGTAGAGTGGT 
GCTGGCCAGAACCATCTGCTTGAGCTACTTGGTTGATTCATATCCTCITrC 

AmCCTGACTCTGAGACT^^ 
GTGACATTTTTTT^ 

GAAACTCAGGTTCCGGGAAGCTTGTGTGTCTAGATACTGAATTGAGATTCTGTTCAGCA 
m . ^^-r a a oTr^PTnr.rTrprfTr a Tr-r^P ATfiGTC ACTGAATTAAATGCTTATTGTAI 



CTCTC' 



TGAGAACCAAGATGGGACCTGAGGACACAAAGATGAGCTCAACAGTCTCAGCCCTAGAGGAAT 
AGACTCAGGGATTTCACCAGGTCGGTGCAGTATTTGATTTCTGGTGAGGTGACCACAGCTGCAG 
T^GGAAGGGAGCCATTGAGCACAGACTTTGGAAGGAACCl^^ 
^GTTTGTTTGTTTGTTTGAGACAGGGTCTTGCTCTGCTACCCAGGCT 

TCTTGGCTCAGTGCAACCTCTGCCTCCTGGGTTCAAGTGATTCTCCTGCCACAGCCTCCTGAGGA 

GCTGGGACTACAGGTGCGTGCTACCACGCCCAGCTACTTCTGTATTTTTAGTAGA 

TCACTGTGTTGGCCAGGCTGGTCTCGAACTCCTGACCTCATGATCTGCCCGCCTCAGCCT 

AGTGCTGGGATTACAAGTGTGAGCCACCACACCTGGCCTGGAAGGAACCTCTTAAAATCAG'm 

ACGTCTTGTATTlTGTrCTGTGATGGAGGACACTGGAGAGAGTTGCTATTCCAGTCAATCATGTC 

GAGTCACTGGACTCTGAAAATCCTATTGGTTCCTTTATTTTATTTGAGTT^ 

^TTTGTATrATGTCTGGCAAATGACCrrGGGTTATCACTTTTCCTCCAGGGTTAGATCATAGATC 
TTGGAAACTCCTTAGAGAGCATTTTGCTCCTACCAAGGATCAGATACTGGAGCCCCACATAATA 
GATITCATrrCACTCTAGCCTACATAGAGCTTTCTGTTGCTGTCTCTTGCCATGCACTrG^GG 
TGATTACACACTTGACAGTACCAGGAGACAAATGACTTACAGATCCCCCGACATGCCTCTTCCC 

CTTGGCAAGCTCAGTTGCCCTGATAGTAGCATGTTTCTC 

CmGCATCAGCCAATTCCCAGAATTTCCCCAGGCAATTTGTAGAGGACCTTTTTGGGGTC^ 

ATGAGCCATGTCCTCAAAGCTTTTAAACCTCCTTGCTCTCCTACAATATTCAGTACATGACCACT 

GTCATCCTAGAAGGCTTCTGAAAAGAGGGGCAAGAGCCACTCTGCGCCACAAAGGTTGGATCC 

ATCTTCTCTCCGAGGTTGTGAAAGTTTTCAaATTGTACTAATAGGCTGGGGCCCTGACTTGGCTG 

TGGGCTITGGGAGGGGTAAGCTGCTTTCTAGATCTCTCCCAGTGAGGCATGGAGGTGTT^GA 

ATITTGTCTACCTCACAGGGATGTTGTGAGGCrrTGAAAAGGTCAAAAAATGATGGCCCCTTGAG 

CTCT1TGTAAGAAAGGTAGATGAAATATCGGATGTAATCTGAAAAAAAGATAAAATGTGACTT 

CCCCTGCTCTGTGCAGCAGTCGGGCTGGATGCTCTGTGGCimTCTTGGGTCCTCATGCCACCCC 

ACAGCTCCAGGAACCTrGAAGCCAATCTGGGGACTTTCAGATGTTTGACAAAGAGGTACCAGG 

CAAACTTCCTGCTACACATGCCCTGAATGAATTGCTAAATTTCAAAGGAAATGGACCCTGCTTT 

TAAGGATGTACAAAAGTATGTCTGCATCGATGTCTGTACTGTAAATTTCTAATTTATCACTGTAC 
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AAAGAAAACCCCTTGCTATTTAATTTTGTATTAAAGGAAA 
AA 

Figure 34 

ACCCAGGGACGGAGGACCCAGGCTGGCTTGGGGACTGTCTGCTCTTCTCGGCGGGAGCCGTGG 

AGAGTCCTTTCCCTGGAATCCGAGCCCTAACCGTCTCTCCCCAGCCCTATCCGGCGAGGAGCGG 

AGCGCTGCCAGCGGAGGCAGCGCCTTCCCGAAGCAGTTTATCTTTGGACGGTlTrCTTTAAAGG 

AAAAACGAACCAACAGGTTGCCAGCCCCGGCGCCACACACGAGACGCCGGAGGGAGAAGCCC 

CGGCCCGGATTCCTCTGCCTGTGTGCGTCCCTCGCGGGCTGCTGGAGGCGAGGGGAGGGAGGG 

GGCGATGGCTCGGCCTGACCCATCCGCGCCGCCCTCGCTGTTGCTGCTGCTCCTGGCGCAGCTG 

GTGGGCCGGGCGGCCGCCGCGTCCAAGGCCCCGGTGTGCCAGGAAATCACGGTGCCCATGTGC 

CGCGGCATCGGCTACAACCTGACGCACATGCCCAACCAGTTCAACCACGACACGCAGGACGAG 

GCGGGCCTGGAGGTGCACCAGTTCTGGCCGCTGGTGGAGATCCAATGCTCGCCGGACCTGCGCT 

TCTTCCTATGCACTATGTACACGCCCATCTGTCTGCCCGACTACCACAAGCCGCTGCCGCCCTGC 

CGCTCGGTGTGCGAGCGCGCCAAGGCCGGCTGCTCGCCGCTGATGCGCCAGTACGGCTTCGCCT 

GGCCCGAGCGCATGAGCTGCGACCGCCTCCCGGTGCTGGGCCGCGACGCCGAGGTCCTCTGCA 

TGGATTACAACCGCAGCGAGGCCACCACGGCGCCCCCCAGGCCTTTCCCAGCCAAGCCCACCCT 

TCCAGGCCCGCCAGGGGCGCCGGCCTCGGGGGGCGAATGCCCCGCTGGGGGCCCGTTCGTGTG 

CAAGTGTCGCGAGCCCTTCGTGCCCATTCTGAAGGAGTCACACCCGCTCTACAACAAGGTGCGG 

ACGGGCCAGGTGCCCAACTGCGCGGTACCCTGCTACCAGCCGtCCTTCAGTGCCGACGAGCGC 

ACGTTCGCCACCTTCTGGATAGGCCTGTGGTCGGTGCTGTGCTTCATCTCCACGTCCACCACAGT 

GGCCACCTTCCTCATCGACATGGACACGTTCCGCTATCCTGAGCGCCCCATCATCTTCCTGTCAG 

CCTGCTACCTGTGCGTGTCGCTGGGCTTCCTGGTGCGTCTGGTCGTGGGCCATGCCAGCGTGGC 

CTGCAGCCGCGAGCACAACCACATCCACTACGAGACCACGGGCCCTGCACTGTGCACCATCGT 

CTTCCTCCTGGTCTACTTCTTCGGCATGGCCAGCTCCATCTGGTGGGTCATCCTGTCGCTCACCT 

GGTTCCTGGCCGCCGCGATGAAGTGGGGCAACGAGGCCATCGCGGGCTACGGCCAGTACTTCC 

ACCTGGCTGCGTGGCTCATCCCCAGCGTCAAGTCCATCACGGCACTGGCGCTGAGCTCCGTGGA 

CGGGGACCCAGTGGCCGGCATCTGCTACGTGGGCAACCAGAACCTGAACTCGCTGCGGCGCTT 

CGTGCTGGGCCCGCTGGTGCTCTACCTGCTGGTGGGCACGCTCTTCCTGCTGGCGGGCTTCGTGT 

CGCTGTTCCGCATCCGCAGCGTCATCAAGCAGGGCGGCACCAAGACGGACAAGCTGGAGAAGC 

TCATGATCCGCATCGGCATCTTCACGCTGCTCTACACGGTCCCCGCCAGCATTGTGGTGGCCTG 

CTACCTGTACGAGCAGCACTACCGCGAGAGCTGGGAGGCGGCGCTCACCTGCGCCTGCCCpGG 

CCACGACACCGGCCAGCCGCGCGCCAAGCCCGAGTACTGGGTGCTCATGCTCAAGTACTTCATG 

tgcctggtggtgggcAtcacgtcgggcgtctggatctggtcgggcaagacggtggagtcgtgG\ 
cggcgtttcaccagccgctgctgctgccgcccgcggcgcggccacaagagcgggggcgccatg ? 
gccgcaggggactaccccgaggcgagcgccgcgctcacaggcaggaccgggccgccgggcccc 

GCCGCCACCTACCACAAGCAGGTGTCCCTGTCGCACGTGTAGGAGGCTGCGGCCGAGGGACTC 

GGCCGGAGAGCTGAGGGGAGGGGGGCGTTTTGTTTGGTAGTTTTGCCAAGGTCACTTCCGTTTA 

CCTTCATGGTGCTGTTGCCCCCTCCCGCGGCGACITGGAGAGAGGGAAGAGGGGCGTTTTCGAG 

GAAGAACCTGTCCCAGGTCTTCTCCAAGGGGCCCAGCTCACGTGTATTCTATTTTGCGTTTCTTA 

CCTGCCTTCTTTATGGGAACCCTCTITTTAATTTATATGTAT 



Figure 35 

GCAGCTCCAGTCCCGGACGCAACCCCGGAGCCGTCTCAGGTCCCTGGGGGGAACGGTGGGTTA 

GACGGGGACGGGAAGGGACAGCGGCCTTCGACCGCCCCCCGAGTAATTGACCCAGGACTCATT 

TrCAGGAAAGCCTGAAAATGAGTAAAATAGTGAAATGAGGAATTTGAACATTTTATCTTTGGAT 

GGGGATCTTCTGAGGATGCAAAGAGTGATTCATCCAAGCCATGTGGTAAAATCAGGAATTTGA 

AGAAAATGGAGATGTTrACATTTTTGTTGACGTGTATTTTTCTACCCCTCCTAAGAGGGC ACAGT 

CTCITCACCTGTGAACCAATTACTGTTCCCAGATGTATGAAAATGGCCTACAACATGACGTTTTT 

CCCTAATCTGATGGGTCATTATGACCAGAGTATTGCCGCGGTGGAAATGGAGCATTTTCTTCCT 

CTCGCAAATCTGGAATGTTCACCAAACATTGAAACTTTCCTCTGCAAAGCATTTGTACCAACCT 

GCATAGAACAAATTCATGTGGTTCCACCrTGTCGTAAACTTTGTGAGAAAGTATATTCTGATTG 
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CAAAAAATTAATTGACACOTITC 

TACTGTGATGAGACTGTTCCTGTAACTTTTGATCCACACACAGAATITCTTGGTCCrCAGAAGA 

AAACAGAACAAGTCCAAAGAGACATTGGATTTTGGTGTCCAAGGCATCTTAAGACTTCTGGGG 

GACAAGGATATAAGTTTCrGGGAATTGACCAGTGTGCGCCTCCATGCCCGAACATGTATTTTAA 

AAGTGATGAGCTAGAGTTTGCAAAAAG1TITATTGGAACAGTTTCAATATTTTG 

ACTCTGTTCACATTCCITACTTTTTTAATTGATGTTAGAAGATTCAGATACCCA 

TATATATTACTCTGTCTGTTACAGCATTGTATCTCTTATGTACTTCATTGGATTITTGCTGGGCGA 

TAGCACAGCCrGCAATAAGGCAGATGAGAAGCTAGAACtT-GGTGACACTGTTGTCCTAGGCTCT 

CAAAATAAGGCTTGCACCGTTTTGTTCATGCTTTTGTATTTm 

GTGGGTGATTCTTACCATTACTTGGTTCTTAGCTGCAGGAAGAAAATGGAGTTGTGAAGCCATC 
GAGCAAAAAGCAGTGTGGTTTCATGCTGTTGCATGGGGAACACCAGGTTTCCTGACTGTTATGC 
TTCrTGCTCTGAACAAAGTTGAAGGAGACAACATTAGTGGAGTTTGCTTTGTTGGCCTTTATGA 
CCTGGATGCTTCTCGCTACTITGTACTCTTGCCACTGTGCCTTTGTGTGTTT 

TCTTTTAGCTGGCATTATTTCCTTAAATCATGTTCGACAAGTCATACAACATGATGGCCGGAACC 

AAGAAAAACTAAAGAAATTTATGATTCGAATTGGAGTCrTCAGCGGCTTGTATCTTGTGCCATT 

AGTGACACTTCTCGGATGTTACGTCTATGAGCAAGTGAACAGGATTACCTGGGAGATAACTTGG 

GTCTCTGATCATTGTCGTCAGTACCATATCCCATGTCCTTATCAGGCAAAAGCAAAAGCTCGAC 

CAGAATTGGCTTTATTTATGATAAAATACCTGATGACATTAATTGTTGGCATCTCTGCTGTCTTC 

TGGGTTGGAAGCAAAAAGACATGCACAGAATGGGCTGGGTTTTITAAACGAAATCGCAAGAGA 

GATCCAATCAGTGAAAGTCGAAGAGTACTACAGGAATCATGTGAGTTTTTCTTAAAGCACAATT 

CTAAAGTTAAACACAAAAAGAAGCACTATAAACCAAGTTCACACAAGCTGAAGGTCATTTCCA 

AATCCATGGGAACCAGCACAGGAGCTACAGCAAATCATGGCACTTCTGCAGTAGCAATTACTA 

GCCATGATTACCTAGGACAAGAAACTTTGACAGAAATCCAAACCTCACCAGAAACATCAATGA 

GAGAGGTGAAAGCGGACGGAGCTAGCACCCCCAGGTTAAGAGAACAGGACTGTGGTGAACCT 

GCCTCGCCAGCAGCATCCATCTCCAGACTCTCTGGGGAACAGGTCGACGGGAAGGGCCAGGCA 

GGCAGTGTATCTGAAAGTGCGCGGAGTGAAGGAAGGATTAGTCCAAAGAGTGATATTACTGAC 

ACTGGCCTGGCACAGAGCAACAATTTGCAGGTCCCCAGTTCTTCAGAACCAAGCAGCCTCAAA 

GGTTCCACATCTCTGCTTGTTCACCCAGTTTCAGGAGTGAGAAAAGAGCAGGGAGGTGGTTGTC 

ATTCAGATACITGAAGAACATTTTCTCTCGTTACTCAGAAGCAAATTTGTGTTACACTGGAAGT 

GACCTATGCACTGTTTTGTAAGAATCACTGTTACGTTCTTCTTITGCACXrAAAGTTG 

TACTGTTATACTGGAAAAAATAGAGTTCAAGAATAATATGACT CATTTG ACACAAAGGTTAATG 

ACAACAATATAeCTGAAAACAGAAATGTGCAGGTTAATAATATTTTTTTAATAGTGTGGGAGGA 

CAGAGTTAGAGGAATCTTCCTTTTCTATTTA^ 

TGTACTATGCTATTTTACCTTTTTGATATA^ 

TATCClTGTATCTTTTTATACATATTTGAAAATAAGCTTAT 

TATTCAAGTATTTTTATCATGCTATT^ 

TAAGAAAATTGTAAAATAGTCTTCTTTTATACTGTAAAAAAAGATATACCAAAAAGT 
TAGGAATTTAACTTTAAAAACCCACTrATTGATACCTTACCATCrAAAATGTGT 
TCTCGTTITAGGAATTTCACAGATCTAAATTATGTAACTGAAATAAGGTGCTTACTCAAAGAGT 
GTCCACTATTGATTGTATTATGCTGCTCACTGATCCTTCTGC 

AGGGTTAGTAGACAAAATGTTAGTCTTTTGTATATTAGGCCAAGTGCAATTGACTTCCC'l 11 1 1 1 

AATGTTTCATGACCACCCATTGATTGTATTATAACCACTTACAGTTGCTTATAT^ 

CITITGTITCTTAACATTTAGAATATTACATTTTGTATTATACAGTA 

AG 



Figure 36 

CTCTCCCAACCGCCTCGTCGCACTCCTCAGGCTGAGAGCACCGCTGCACTCGCGGCCGGCGATG 

CGGGACCCCGGCGCGGCCGCTCCGCTTTCGTCCCTGGGCCTCTGTGCCCTGGTGCTGGCGCTGC 

TGGGCGCACTGTCCGCGGGCGCCGGGGCGCAGCCGTACCACGGAGAGAAGGGCATCTCCGTGC 

CGGACCACGGCTTCTGCCAGCCCATCTCCATCCCGCTGTGCACGGACATCGCCTACAACCAGAC 

CATCCTGCCCAACCTGCTGGGCCACACGAACCAAGAGGACGCGGGCCTCGAGGTGCACCAGTT 

CTACCCGCTGGTGAAGGTGCAGTGTTCTCCCGAACTCCGCTIT1TCITATGCTCCATGTATGCGC 

CCGTGTGCACCGTGCTCGATCAGGCCATCCCGCCGTGTCGTTCTCTGTGCGAGCGCGCCCGCCA 
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GGGCTGCGAGGCGCTCATGAACAAGTTCGGCTTCCAGTGGCCCGAGCGGCTGCGCTGCGAGAA 

CTTCCCGGTGCACGGTGCGGGCGAGATCTGCGTGGGCCAGAACACGTCGGACGGCTCCGGGGG 

CCCAGGCGGCGGCCCCACTGCCTACCCTACCGCGCCCTACCTGCCGGACCTGCCCTTCACCGCG 

CTGCCCCCGGGGGCCTCAGATGGCAGGGGGCGTCCCGCCTTCCCCTTCTCATGCCCCCGTCAGC 

TCAAGGTGCCCCCGTACCTGGGCTACCGCTTCCTGGGTGAGCGCGATTGTGGCGCCCCGTGCGA 

ACCGGGCCGTGCCAACGGCCrGATGTACTTTAAGGAGGAGGAGAGGCGCTTCGCCCGCCTCTG 

GGTGGGCGTGTGGTCCGTGCTGTGCTGCGCCTCGACGCTCTTTAGCGTTCTCACCTACCTGGTGG 

ACATGCGGCGCTTCAGCTACCCAGAGCGGCCCATCATCTTCCTGTCGGGCTGCTACTTCATGGT 

GGCCGTGGCGCACGTGGCCGGCTTCCTTCTAGAGGACCGCGCCGTGTGCGTGGAGCGCTTCTCG 

GACGATGGCTACCGCACGGTGGCGCAGGGCACCAAGAAGGAGGGCTGCACCATCCTCTTCATG 

GTGCTCTACTTCTTCGGCATGGCCAGCTCCATCTGGTGGGTCATTGTGTCTCTCACTTGGTTCCT 

GGCGGCCGGCATGAAGTGGGGCCACGAGGCCATCGAGGCCAACTCGCAGTACTTCCACCTGGC 

CGCGTGGGCCGTGCCCGCCGTCAAGACCATCACTATCCTGGCCATGGGCCAGGTAGACGGGGA 

CCTGCTGAGCGGGGTGTGCTACGTTGGCCTCTCCAGTGTGGACGCGCTGCGGGGCTTCGTGCTG 

GCGCCTCTGTTCGTCTACCTCTrCATAGGCACGTCCTTCTTGCTGGCCGGCTTCGTGTCCCTCTTC 

CGTATCCGCACCATCATGAAACACGACGGCACCAAGACCGAGAAGCTGGAGAAGCTCATGGTG 

CGCATCGGCGTCTTCAGCGTGCTCTACACAGTGCCCGCCACCATCGTCCTGGCCTGCTACTTCTA 

CGAGCAGGCCTTCCGCGAGCACTGGGAGCGCACCTGGCTCCTGCAGACGTGCAAGAGCTATGC 

CGTGCCCTGCCCGCCCGGCCACTTCCCGCCCATGAGCCCCGACTTCACCGTCTTCATGATCAAG 

TACCTGATGACCATGATCGTCGGCATCACCACTGGCTTCTGGATCTGGTCGGGGAAGACCCTGC 

AGTCGTGGCGCCGCTTCTACCACAGACTTAGCCACAGCAGCAAGGGGGAGACTGCGGTATGAG 

CCCCGGCCCCTCCCCACCTTTCCCACCCCAGCCCTCTTGCAAGAGGAGAGGCACGGTAGGGAAA 

AGAACTGCTGGGTGGGGGCCTGTTTCTGTAACTTTCrCCCCCTCTACTGAGAAGTGACCTGGAA 

GTGAGAAGTTCITTGCAGATTTGGGGCGAGGGGTGATTTGGAAAAGAAGACCTGGGTGGAAAG 

CGGTTTGGATGAAAAGATTTCAGGCAAAGACTTGCAGGAAGATGATGATAACGGCGATGTGAA 

TCGTCAAAGGTACGGGCCAGCTTGTGCCTAATAGAAGGTTGAGACCAGCAGAGACTGCTGTGA 

GTTTCTCCCGGCTCCGAGGCTGAACGGGGACTGTGAGCGATCCCCCTGCTGCAGGGCGAGTGGC 

CTGTCCAGACCCCrGTGAGGCCCCGGGAAAGGTACAGCCCrGTCTGGGGTGGCTGCTTTGTTGG 

AAAGAGGGAGGGCCTCCrGCGGTGTGCtTGTCAAGCAGTGGTCAAACCATAATCrCITTTCACT 

GGGGCCAAACTGGAGCCCAGATGGGTTAATTTCCAGGGTCAGACATTACGGTCTCTCCTCCCCT 

GCCCCCTCCCGCCTGTTTTTCCTCCCGTACrGCTTTCAGGTCTTGTAAAATAAGCATT^ 

CTTGGGAGGCCTGCCTGCTAGAATGCTAATGTGAGGATGCAAAAGAAATGATGATAACATTTTG 

AGATAAGGCCAAGGAGACGTGGAGTAGGTATTTTTGCTACITmCATTTTCTGGGGAAGGCAG 

GAGGCAGAAAGACGGGTGTTTTATTTGGTCrAATACCCrGAAAAGAAGTGATGACrrGTTGCTT 

TTCAAAACAGGAATGCATTTTTCCCCITGTCTITGTTG 

GTCTCCCTGTGGAAAGGCATAACTGTGACGAAAGCAACITTTATAGGCAAAGCAGCGCAAATC 
TGAGGTTTCCCGTTGGTTGTTAATTTGGTTGAGATAA 

GCAGTGTGCTGTCACACACCGTTAAGCCAGAGGTTCTGACTTCGCTAAAGGAAATGTAAGAGG 

TTTTGTTGTCTGTTTTAAATAAATTTAATTCGGAACACATGATCCAACAGACT 

TCAGGGAAATCTCTCCCTTCATTTACTTITTCTTGCTATAAGCCrATATTT 

TTTirrcrCCCATTTGGATCCTTTGAGGTAAAAAAACATAATGTCITCAGCCrC^ 

AAGTTAATTAAAAAAAAAAAGCAAAGAGCCATTTTGTCCT 

TTATTAAACATCATCCATATGCTGACCCTGTCTCTGTGTGGTTGGGTTGGGAGGCGATCAGCAG 
ATACCATAGTGAACGAAGAGGAAGGTTTGAACCATGGGCCCCATCTTTAAAGAAAGTCATTAA 
AAGAAGGTAAACTTCAAAGTGATTCTGGAGTTCTTTGAAATGTGCTGGAAGACITAAATTTATr 
AATCTTAAATCATGTACTTTTTTTCTC 

TTAGCAGAGAATCATGGGAGCTAACCTTTATCCCACCTTTGACACTACCCTCCAATCTTGCAAC 
ACTATCCTGTTTCTCAGAACAGTTTTTAAATGCCAATCATAGAGGGTACTGTAAAGTGTACAAG 
TTACTTTATATATGTAATGTTCACTTGAGTGGAACTGCTTTTTACATTA 
TGTGTTTCTTCAACCTTCAAAACTATCTCATCT 

GGATCirrrGTGCTGTATCTTTTAAGTGCATGTGAAATTTGTAAAATAGAGATAAGTACAGTAT 

GTATATTTTGTAAATCTCCCATTTTTGT^ 

ATTTTTGTTTTGTTGGCTTTAAAGGTCT 

TTTTTAAATAC 
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ACAGCATGGAGTGGGGTTACCTGTTGGAAGTGACCTCGCTGCTGGCCGCCTTGGCGCTGCTGCA 
GCGCTCTAGCGGCGCTGCGGCCGCCTCGGCCAAGGAGCTGGCATGCCAAGAGATCACCGTGCC 
GCTGTGTAAGGGCATCGGCTACAACTACACCTACATGCCCAATCAGTTCAACCACGACACGCA 
AGACGAGGCGGGCCTGGAGGTGCACCAGTTCTGGCCGCTGGTGGAGATCCAGTGCTCGCCCGA 

tctcaagttcttcctgtgcagcatgtacacgcgcatctgcctagAggactacaagaagccgctg 

CCGCCCTGCCGCTCGGTGTGCGAGCGCGCCAAGGCCGGCTGCGCGCCGCTCATGCGCCAGTAC 

GGCTTCGCCTGGCCCGACCGCATGCGCTGCGACCGGCTGCCCGAGCAAGGCAACCCTGACACG 

CTGTGCATGGACTACAACCGCACCGACCTAACCACCGCCGCGCCCAGCCCGCCGCGCCGCCTGC 

CGCCGCCGCCGCCCGGCGAGCAGCCGCCTTCGGGCAGCGGGCACGGCCGCCCGCCGGGGGCCA 

GGCCCCCGCACCGCGGAGGCGGCAGGGGCGGTGGCGGCGGGGACGCGGCGGCGCCCCCAGCT 

CGCGGCGGCGGCGGTGGCGGGAAGGCGCGGCCCCCTGGCGGCGGCGCGGCTCCCTGCGAGCCC 

GGGTGCCAGTGCCGCGCGCCTATGGTGAGCGTGTCCAGCGAGCGCCAGCCGCTCTACAACCGC 

GTCAAGACAGGCCAGATCGCTAACTGCGCGCTGCCCrGCCACAACCCCTTTTTCAGCCAGGACG 

AGCGGGCCTTCACCGTCTTCTGGATCGGCCTGTGGTCGGTGCTCTGCTTCGTGTCCACCTTCGCC 

ACCGTCrCCACCTTCCTTATCGACATGGAGCGCTTCAAGTACCCGGAGCGGCCCATTATCTTCCT 

CTCGGCCTGCTACCTCTTCGTGTCGGTGGGCTACCTAGTGCGCCTGGTGGCGGGCCACGAGAAG 

GTGGCGTGCAGCGGTGGCGCGCCGGGCGCGGGGGGCGCTGGGGGCGCGGGCGGCGCGGCGGC 

GGGCGCGGGCGCGGCGGGCGCGGGCGCGGGCGGCCCGGGCGGGCGCGGCGAGTACGAGGAGC 

TGGGCGCGGTGGAGCAGCACGTGCGeTACGAGACCACCGGCCCCGCGCTGTGCACCGTGGTCT 

TCTTGGTGGTCTACTTCTTCGGCATGGCCAGCTCCATCTGGTGGGTGATCTTGTCGCTCACATGG 

TTCCTGGGGGCCGGTATGAAGTGGGGCAACGAAGCCATCGCCGGCTACTCGCAGTACTTCCACC 

TGGCCGCGTGGCTTGTGCCCAGCGTCAAGTCCATCGCGGTGCTGGCGGTCAGCTCGGTGGACGG 

CGACCCGGTGGCGGGCATCTGCTACGTGGGCAACCAGAGCCTGGACAACGTGCGCGGCTTCGT 

GCTGGCGCCGCTGGTCATCTACCTCTTCATCGGCACCATGTTCCTGCTGGCCGGCTTCGTGTCCC 

TGTTCCGCATCCGCTCGGTCATCAAGCAACAGGACGGCCGCACCAAGACGCACAAGCTGGAGA 

AGCTGATGATCCGCCTGGGGCTGTTCACCGTGGTCTACACCGTGCCCGCCGCGGTGGTGGTCGC 

CTGCCTCTTCTACGAGCAGCACAACCGCCCGCGCTGGGAGGCCACGCACAACTGCGCGTGCCTG 

CGGGACCTGCAGCGCGACCAGGCACGCAGGCCCGACTACGCCGTCTTCATGCTCAAGTACTTCA 

TGTGeCTAGTGGTGGGCATCACCTCGGGCGTGTGGGTCTGGTCCGGCAAGACGCTGGAGTCCTG 

GCGCTCCCTGTGCACCCGCTGCTGCTGGGCCAGCAAGGGCGCCGCGGTGGGGGGGGGCGCGGG 

CGCCACGGCCGCGGGGGGTGGCGGCGGGCCGGGGGGCGGCGGCGGCGGGGGACCCGGCGGCG 

GCGGGGGGCCGGGCGGCGGCGGGGGCTCCCTCTACAGCGACGTCAGCACTGGCCTGACGTGGC 

GGTCGGGCACGGCGAGCTCCGTGTCITATCCAAAGCAGATGCCATTGTCCCAGGTCTGAGCGGA 

GGGGAGGGGGCGCGCAGGAGGGGTGGGGAGGGGGGCGAGGAGACCCAAGTGCAGCGAAGGG 

ACACITGATGGGCrGAGGTTCCCACCCCTTCACAGTGTTGATTGCTATTAGCATGATAATGAAC 

tcitaatggtatccattagcrgggacttaaatgacrcaotagaacaaagtaccrggcattgaa 

gcctcccagacccaggcccrrrtcctccattgatgtgcggggagctcctcccgccaggcgtraat 

ttctgttggcrgaggagggtggactctgcggcgtttccagaacccgagatttggagccctccct 

ggctgcacttggctgggtttgcagtcagatacacagatttcacctgggagaacctcil 1 1 lctcc 

ctcgActgttcctacgtaaactcccacccctgacttaccctggaggaggggtgaccgccacctg 

atgggattgcacggtttgggtattcitaatgaccaggcaaatgccrtaagtaaacaaacaaga 

aatgtcitaattatacaccccacgtaaatacgggtttcttacattagaggatgtatttatataat 

tatrrgttaaattgtaaaaaaaaaaagtgtaaaatatgtatatatccaaagatatagtgtgtac 

atttttttgtaaaaagtttagaggcitacccctgtaagaacagata^ 

ATAAAATGACITTTGATAAATGATTTAACCATTGCCCrcrCCCCCGCCrCITGr 

TITAAAGTGCITGCrAAGGACGCATGGGGAAAATGGACATTTTGrGGCnTGTCATTCTGTACAC 

TGACCTTAGGCATGGAGAAAATTACTTGTTAAACTCTAGTTCTTAAGTTGTTAGCCAAGTAAAT 

ATCATTGTTGAACTGAAATCAAAATTGAGTTTTTGCACCITCCCCAAAGACGGTGTTm 

GAGCTCTTTTCTGATCCATGGATAACAACTCTCACITTAGTGGATGTAAATGGAACITCTGCAA 

GGCAGTAATTCCCCTTAGGCCTTGTrATTTATCCTGCATGGTATCACTAAAGGTTTCAAAACCCT 

GAAAAAAAA 
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CCGCCITCGGCCCGGGCCTCCCGGGATGGCCGTGGCGCCTCTGCGGGGGGCGCTGCTGCTGTGG 

CAGCTGCTGGCGGCGGGCGGCGCGGCACTGGAGATCGGCCGCTTCGACCCGGAGCGCGGGCGC 

GGGGCTGCGCCGTGCCAGGCGGTGGAGATCCCCATGTGCCGCGGCATCGGCTACAACCTGACC 

CGCATGCCCAACCTGCTGGGCCACACGTCGCAGGGCGAGGCGGCTGCCGAGCTAGCGGAGTTC 

GCGCCGCTGGTGCAGTACGGCTGCCACAGCCACCTGCGCin'CTTCCTGTGCTCGCTCTACGCGC 

CCATGTGCACCGACCAGGTCTCGACGCCCATTCCGGCCTGCCGGCCCATGTGCGAGCAGGCGCG 

CCTGCGCTGCGCGCCCATCATGGAGCAGTTCAACTTCGGCTGGCCGGACTCGCTCGACTGCGCC 

CGGCTGCCCACGCGCAACGACCCGCACGCGCTGTGCATGGAGGCGCCCGAGAACGCCACGGCC 

GGCCCCGCGGAGCCCCACAAGGGCCTGGGCATGCTGCCCGTGGCGCCGCGGCCCGCGCGCCCT 

CCCGGAGACCTGGGCCCGGGCGCGGGCGGCAGTGGCACCTGCGAGAACCCCGAGAAGTTCCAG 

TACGTGGAGAAGAGCCGCTCGTGCGCACCGCGCTGCGGGCCCGGCGTCGAGGTGTTCTGGTCC 

CGGCGCGACAAGGACTTCGCGCTGGTCTGGATGGCCGTGTGGTCGGCGCTGTGCTTCTTCTCCA 

CCGCCTTCACTGTGCTCACCTTCTTGCTGGAGCCCCACCGCTTCCAGTACCCCGAGCGCCCCATC 

ATCTTCCTCTCCATGTGCTACAACGTCTACTCGCTGGCCTTCCTGATCCGTGCGGTGGCCGGAGC 

GCAGAGCGTGGCCTGTGACCAGGAGGCGGGCGCGCTCTACGTGATCCAGGAGGGCCTGGAGAA 

CACGGGCTGCACGCTGGTCTTCCTACTGCrCTACTACTTCGGCATGGCCAGCTCGCTCTGGTGG 

GTGGTCCTGACGCTCACCTGGTTCCTGGCTGCCGGGAAGAAATGGGGCCACGAGGCCATCGAG 

GCCCACGGCAGCTATTTCCACATGGCTGCCTGGGGCCTGCCCGCGCTCAAGACCATCGTCATCC 

TGACCCTGCGCAAGGTGGCGGGTGATGAGCTGACTGGGCTTTGCTACGTGGCCAGCACGGATG 

CAGCAGCGCTCACGGGCTTCGTGCTGGTGCCCCTCTCTGGCTACCTGGTGCTGGGCAGTAGTTT 

CCTCCTGACCGGCTTCGTGGCCCTCTTCCACATCCGCAAGATCATGAAGACGGGCGGCACCAAC 

ACAGAGAAGCTGGAGAA.GCTCATGGTCAAGATCGGGGTCTTCTCCATCCTCTACACGGTGCCCG 

CCACCTGCGTCATCGTTTGCTATGTCTACGAACGCCTCAACATGGACrTCTGGCGCCTTCGGGCC 

ACAGAGCAGCCATGCGCAGCGGCCGCGGGGCCCGGAGGCCGGAGGGACTGCTCGCTGCCAGG 

GGGCTCGGTGCCCACCGTGGCGGTCTTCATGCTCAAAATTTTCATGTCACTGGTGGTGGGGATC 

ACCAGCGGCGTCTGGGTGTGGAGCTCCAAGACTTTCCAGACCTGGCAGAGCCTGTGCTACCGCA 

AGATAGCAGCTGGCCGGGCCCGGGCCAAGGCCTGCCGCGCCCCCGGGAGCTACGGACGTGGCA 

CGCACTGCCACTATAAGGCTCCCACCGTGGTCTTGCACATGACTAAGACGGACCCCTCTTTGGA 

GAACCCCACACACCTCTAGCCACACAGGCCTGGCGCGGGGTGGCTGCTGCCCCCTCCTTGCCCT 

CCACGCCCTGCCCCCTGCATCCCCTAGAGACAGCTGACTAGCAGCTGCCCAGCTGTCAAGGTCA 

GGCAAGTGAGCACCGGGGACTGAGGATCAGGGCGGGACCCCGTGAGGCTCATTAGGGGAGAT 

GGGGGTCTCCCCTAATGCGGGGGCTGGACGAGGCTGAGTCCCCACAGGGTCCTAGTGGAGGAT 

GTGGAGGGGCGGGGCAGAG<3GGTCCAGCCGGAGTTTATTTAATGATGTAATTTATTGTTGCGTT 

CCTCrGGAAGCTGTGACTGGAATAAACCCCCGCGTGGCACTGCTGATCCTCTCTGGCTGGGAAG 

GGGGAAGGTAGGAGGTGAGGC 



Figure 39 

ACACGTCCAACGCCAGCATGCAGCGCCCGGGCCCCCGCCTGTGGCTGGTCCTGCAGGTGATGG 

GCTCGTGCGCCGCCATCAGCTCCATGGACATGGAGCGCCCGGGCGACGGCAAATGCCAGCCCA 

TCGAGATCCCGATGTGCAAGGACATCGGCTACAACATGACTCGTATGCCCAACCTGATGGGCC 

ACGAGAACCAGCGCGAGGCAGCCATCCAGTTGCACGAGTTCGCGCCGCTGGTGGAGTACGGCT 

GCCACGGCCACCTCCGCTTCTTCCTGTGCTCGCTGTACGCGCCGATGTGCACCGAGCAGGTCTC 

TACCCCCATCCCCGCCTGCCGGGTCATGTGCGAGCAGGCCCGGCTCAAGTGCTCCCCGATTATG 

GAGCAGTTCAACTTCAAGTGGCCCGACTCCCTGGACTGCCGGAAACTCCCCAACAAGAACGAC 

CCCAACTACCTGTGCATGGAGGCGCCCAACAACGGCTCGGACGAGCCCACCCGGGGCTCGGGC 

CTGTTCCCGCCGCTGTTCCGGCCGCAGCGGCCCCACAGCGCGCAGGAGCACCCGCTGAAGGAC 

GGGGGCCCCGGGCGCGGCGGCTGCGACAACCCGGGCAAGTTCCACCACGTGGAGAAGAGCGC 

GTCGTGCGCGCCGCTCTGCACGCCCGGCGTGGACGTGTACTGGAGCCGCGAGGACAAGCGCTT 

CGCAGTGGTCTGGCTGGCCATCTGGGCGGTGCTGTGCTTCTTCTCCAGCGCCTTCACCGTGCTCA 

CCTTCCTCATCGACCCGGCCCGCTTCCGCTACCCCGAGCGCCCCATCATCTTCCTCTCCATGTGC 

TACTGCGTCTACTCCGTGGGCTACCTCATCCGCCTCTTCGCCGGCGCCGAGAGCATCGCCTGCG 

ACCGGGACAGCGGCCAGCTCTATGTCATCCAGGAGGGACTGGAGAGCACCGGCTGCACGCTGG 

TCTTCCTGGTCCTCTACTACTTCGGCATGGCCAGCTCGCTGTGGTGGGTGGTCCTCACGCTCACC 
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TGGTTCCTGGCCGCCGGCAAGAA.GTGGGGCCACGAGGCCATCGAAGCCAACAGGAGCTACTTC 

CACCTGGCAGCCTGGGCCATCCCGGCGGTGAAGACCATCCTGATCCTGGTCATGCGCAGGGTG 

GCGGGGGACGAGCTCACCGGGGTCTGCTACGTGGGCAGCATGGACGTCAACGCGCTCACCGGC 

TTCGTGCTCATTCCCCTGGCCTGCTACCTGGTCATCGGCACGTCCTTCATCCTCTCGGGCTTCGT 

GGCCCTGTTCCACATCCGGAGGGTGATGAAGACGGGCGGCGAGAACACGGACAAGCTGGAGA 

AGCTCATGGTGCGTATCGGGCTCTTCTCTGTGCTGTACACCGTGCCGGCCACCTGTGTGATCGCC 

TGCTACTTTTACGAACGCCTCAACATGGATTACTGGAAGATCCTGGCGGCGCAGCACAAGTGCA 

AAATGAACAACCAGACTAAAACGCTGGACTGCCTGATGGCCGCCTCCATCCCCGCCGTGGAGA 

TCTTCATGGTGAAGATCTTTATGCTGCTGGTGGTGGGGATCACCAGCGGGATGTGGATTTGGAC 

CTCCAAGACTCTGCAGTCCTGGCAGCAGGTGTGCAGCCGTAGGTTAAAGAAGAAGAGCCGGAG 

AAAACCGGCCAGCGTGATCACCAGCGGTGGGATTTACAAAAAAGCCCAGCATCCCCAGAAAAC 

TCACCACGGGAAATATGAGATCCCTGCCCAGTCGCCCACCTGCGTGTGAACAGGGCTGGAGGG 

AAGGGCACAGGGGCGCCCGGAGCTAAGATGTGGTGCTITrCTTGGTrGTGTTITrCTTTOT 

CrTTCTTTTTTI 11111 ITATAAAAGCAAAAGAGAAATACATAAAAAAGTGTTTACCCTGAAATTC 

AGGATGCTGTGATACACTGAAAGGAAAAATGTACTTAAAGG^ 

AGCGAAGGGAAGCTCCTCCAGTGAAGTAGCCTCITGTGTAACTAATTTGTGGTAAAGTAGTTGA 

rrCAGCCCTCAGAAGAAAACTTTTGTTTAGAGCCCTCCGTAAATATACATCTGTGTATTTGAGTT 

GGCrTrGCrACCCATTrACAAATAAGAGGACAGATAACTGCTTTGCAAATTCAAGAGCCTCCCC 

TGGGTTAACAAATGAGCCATCCCCAGGGCCCACCCCCAGGAAGGCCACAGTGCTGGGCGGCAT 

CCCTGCAGAGGAAAGACAGGACCCGGGGCCCGCCTCACACCCCAGTGGATTTGGAGTTGCTTA 

AAATAGACTCTGGCCTTCACCAATAGTCTCTCTGCAAGACAGAAACCTCCATCAAACCTCACAT 

ITGTGAACTCAAACGATGTGCAATACATTTTTTTCTCTTTCCTTGAAAATAAA^ 

GTATTTTGCTATATATAAAGACAACAAAAGAAATCTCCTAACAAAAGAACTAAGAGGCCCAGC 

CCTCAGAAACCCTTCAGTGCTACATTTTGTGGCTITITAATGGAAACCAAGCCAATGTTATAGA 

CGTTTGGACTGATTTGTGGAAAGGAGGGGGGAAGAGGGAGAAGGATCATTCAAAAGTTACCCA 

AAGGGCTTATTGACTCTTTCTATTGTTAAACAAATGATTTCCACAAACAGATCAGGAAGCACTA 

GGTTGGCAGAGACACITrGTCTAGTGTATTCTCTTCACAGTGCCAGGAAAGAGTGGTTTCTGCG 

TGTGTATATTTGTAATATATGATATTTTTCATGCTCCACTATTTTATTAAAAA^ 

TTAAAAAAA 
Figure 40 

CCTGCAGCCTCCGGAGTCAGTGCCGCGCGCCCGCCGCCCCGCGCCTTCCTGCTCGCCGCACCTC 

CGGGAGCCGGGGCGCACCCAGCCCGCAGCGCCGCCTCGGCGCCCGCGCCGCCTCCGACCGCAG 

GCCGAGGGCCGCCACTGGCCGGGGGGACCGGGCAGCAGCTTGCGGCCGCGGAGCCGGGCAAC 

GCTGGGGACTGCGCCrTTTGTCCCCGGAGGTCCCTGGAAGTfTGCGGCAGGACGCGCGCGGGG 

AGGCGGCGGAGGCAGCCCCGACGTCGCGGAGAACAGGGCGCAGAGCCGGCATGGGCATCGGG 

CGCAGCGAGGGGGGCCGCCGCGGGGCCCTGGGCGTGCTGCTGGCGCTGGGCGCGGCGCTTCTG 

GCCGTGGGCTCGGCCAGCGAGTACGACTACGTGAGCTTCCAGTCGGACATCGGCCCGTACCAG 

AGCGGGCGCTTCTACACCAAGCCACCTCAGTGCGTGGACATCCCCGCGGACCTGCGGCTGTGCC 

ACAACGTGGGCTACAAGAAGATGGTGCTGCCCAACCTGCTGGAGCACGAGACCATGGCGGAGG 

TGAAGCAGCAGGCCAGCAGCTGGGTGCCCCTGCTCAACAAGAACTGCCACGCCGGGACCCAGG 

TCTTCCTCTGCTCGCTCTTCGCGCCCGTCTGCCTGGACCGGCCCATCTACCCGTGTCGCTGGCTC 

TGCGAGGCCGTGCGCGACTCGTGCGAGCCGGTCATGCAGTTCTTCGGCTTCTACTGGGCCGAGA 

TGCTTAAGTGTGACAAGTTCCCGGAGGGGGACGTCTGCATCGCCATGACGCCGCCCAATGCCAC 

CGAAGCCTCCAAGCCCCAAGGCACAACGGTGTGTCCTCCCTGTGACAACGAGTTGAAATCTGA 

GGCCATCATTGAACATCTCTGTGCCAGCGAGTTTGCACTGAGGATGAAAATAAAAGAAGTGAA 

AAAAGAAAATGGCGACAAGAAGATTGTCCCCAAGAAGAAGAAGCCCCTGAAGTTGGGGCCCA 

TCAAGAAGAAGGACCTGAAGAAGCTTGTGCTGTACCTGAAGAATGGGGCTGACTGTCCCTGCC 

ACCAGCTGGACAACCTCAGCCACCACTTCCTCATCATGGGCCGCAAGGTGAAGAGCCAGTACTT 

GCTGACGGCCATCCACAAGTGGGACAAGAAAAACAAGGAGTTCAAAAACTTCATGAAGAAAA 

TGAAAAACCATGAGTGCCCCACCTTTCAGTCCGTGTTTAAGTGATTCTCCCGGGGGCAGGGTGG 

GGAGGGAGCCTCGGGTGGGGTGGGAGCGGGGGGGACAGTGCCCGGGAACCCGTGGTCACACA 

CACGCACTGCCCTGTCAGTAGTGGACATTGTAATCCAGTCGGCTTGTTCTTGCAGCATTCCC 
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CCCTTTCCCTCCATAGCCACGCTCCAAACCCCAGGGTAGCCATGGCCGGGTAAAGCAAGGGCC 

ATTTAGATTAGGAAGGTTTTTAAGATCCGCAATGTGGAGCAGCAGCCACTGCACAGGAGGAGG 

TGACAAACCATTTCCAACAGCAACACAGCCACTAAAACACAAAAAGGGGGATTGGGCGGAAA 

GTGAGAGCCAGCAGCAAAAACTACATTTTGCAACTTGTTGGTGTGGATCTATTGGCTGATCTAT 

GCCTTTCAACTAGAAAATTCTAATGATTGGCAAGTCACGTTC 

TTCTGTCTGCTTTAAATGGAAACAGACTCATACCACACITACAATTAAGGTCA^GCCCAGAAAG 

TGATAAGTGCAGGGAGGAAAAGTGCAAGTCCATTATCTAATAGTGACAGCAAAGGGACCAGGG 

GAGAGGCATTGCCTTCTCTGCCCACAGTCTTTCCGTGTGATTGTCTTTGAATCTGAATCAGCCAG 

TCTCAGATGCCCCAAAGTTTCGGTTCCTATGAGCCCGGGGCATGATCTGATCCCCAAGACATGT 

GGAGGGGCAGCCTGTGCCTGCCTTTGTGTCAGAAAAAGGAAACCACAGTGAGCCTGAGAGAGA 

CGGCGATTTTCGGGCTGAGAAGGCAGTAGTTTTCAAAACACATAGTTA 



Figure 41 

GAATTCGTTCAGCCTGGTTAAGTCCAAGCTGGCTCATTCTGCTCCCCCGGGTCGGAGCCCCCCG 

GAGCTGCGCGCGGGCITGCAGCGCCTCGCCCGCGCTGTCCTCCCGGTGTCCCGCTTCTCCGCGC 

CCCAGCCGCCGGCTGCCAGCTTTTCGGGGCCCCGAGTCGCACCCAGCGAAGAGAGCGGGCCCG 

GGACAAGCTCGAACTCCGGCCGCCTCGCCCTTAACCAGGTCCGTCCCTCTACCCCCTAGGGGTC 

GCGCCCACGATGCTGCAGGGCCCTGGCTCGCTGCTGCTGCTCTTCCTCGCCTCGCACTGCTGCCT 

GGGCTCGGCGCGCGGGCTCTTCCTCTTrGGCCAGCCCGACTTCTCCTACAAGCGCAGCAATTGC 

AAGCCCATCCCGGCCAACCTGCAGCTGTGCCACGGCATCGAATACCAGAACATGCGGCTGCCC 

AA.CCTGCTGGGCCACGAGACCATGAAGGAGGTGCTGGAGCAGGCCGGCGCTTGGATCCCGCTG 

GTCATGAAGCAGTGCCACCCGGACACCAAGAAGTTCCTGTGCrCGCTCTTCGCCCCCGTCTGCC 

TCGATGACCTAGACGAGACCATCCAGCCATGCCACTCTCGNTGCGTGCAGGTGAAGGATCGCT 

GCGCCCCGGTCATGTCCGCCTTCCCCTGGCCCGACATGCTTGAGTGCGACCGTTTCCCCCAGGA 

CAACGACCTTTGCATCCCCCTCGCTAGCAGCGACCACCTCCTGCCAGCCACCGAGGAAGCTCCA 

AAGGTATGTGAAGCCTGGAAAAATAAAAATGATGATGACAACGACATAATGGAAACGCTTTGT 

AAA\ATGATTTTGCACTGAAAATAAAAGTGAAGGAGATAACGTACATCAACCGT 



Figure 42 

CCGGGTCGGAGCCCCCCGGAGCTGCGCGCGGGCTTGCAGCGCCTCGCCCGCGCTGTCCTCCCGGTGTCCC 
GCTTCTCCGCGCCCCAGCCGCCGGCTGCCAGCTTTTCGGGGCCCCGAGTCGCACCCAGCGAAGAGAGCGG 
GCCCGGGACAAGCTCGAACTCCGGCCGCCTCGCCCTTCCCCGGCTCCGCTCCCTCTGCCCCCTCGGGGTC 
GCGCGCCCACGATGCTGCAGGGCCCTGGCTCGCTGCTGCTGCTCTTCCTCGCCTCGCACTGCTGCCTGGG 
CTCGGCGCGCGGGCTCTTCCTCTTTGGCCAGCCCGACTTCTCCTACAAGCGCAGCAATTGCAAGCCCATC 
CCTGCCAACCTGCAGCTGTGCCACGGCATCGAATACCAGAA.CATGCGGCTGCGCAACCTGCTGGGCCACG 
AGACCATGAAGGAGGTGCTGGAGCAGGCCGGCGCTTGGATCCCGCTGGTCATGAAGCAGTGCCACCCGGA 
CACCAAGAAGTTCCTGTGCTCGCTCTTCGCCCCCGTCTGCCTCGATGACCTAGACGAGACCATCCAGCCA 
TGCCACTCGCTCTGCGTGCAGGTGAAGGACCGCTGCGCCCCGGTCATGTCCGCCTTCGGCTTCCCCTGGC 
CCGACATGCTTGAGTGCGACCGTTTCCCCCAGGACAACGACCTTTGCATCCCCCTCGCTAGCAGCGACCA 
CCTCCTGCCAGCCACCGAGGAAGCTCCAAAGGTATGTGAAGCCTGCAAAAATAAAAATGATGATGACAAC 
GACATAATGGAAACGCTTTGTAAAAATGATTTTGCACTGAAAATAAAAGTGAAGGAGATAACCTACATCA 
ACCGAGATACCAAAATCATCCTGGAGACCAAGAGCAAGACCATTTACAAGCTGAACGGTGTGTCCGAAAG 
GGACCTGAAGAAATCGGTGCTGTGGCTCAAAGACAGCTTGCAGTGCACCTGTGAGGAGATGAACGACATC 
AACGCGCCCTATCTGGTCATGGGACAGAAACAGGGTGGGGAGCTGGTGATCACCTCGGTGAAGCGGTGGC 
AGAAGGGGCAGAGAGAGTTCAAGCGCATCTCCCGCAGCATCCGCAAGCTGCAGTGCTAGTCCCGGCATCC 
TGATGGCTCCGACAGGCCTGCTCCAGAGCACGGCTGACCATTTCTGCTCCGGGATCTCAGCTCCCGTTCC 
CCAAGCACACTCCTAGCTG.CTCCAGTCTCAGCCTGGGCAGCTTCCCCCTGCCTTTTGCACGTTTGCATCC 
CCAGCATTTCCTGAGTTATAAGGCCACAGGAGTGGATAGCTGTTTTCACCTAAAGGAAAAGCCCACCCGA 
ATCTTGTAGAAATATTCAAACTAATAAAATCATGAATATTTTTATGAAGTTT 
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ACGGGGCCTGGGCGGSAGGGGCGGTGGCTGGAGCTCGGTAAAGCTCGTGGGACCCCATTGGGG 

GAATTTGATCCAAGGAAGCGGTGATTGCCGGGGGAGGAGAAGCTCCCAGATCCTTGTGTCCAC 

TTGCAGCGGGGGAGGCGGAGACGCGGAGCGGGCCTTTTGGCGTCCACTGCGCGGCTGCACCCT 

GCCCCATCCTGCCGGGATCATGGTCTGCGGCAGCCCGGGAGGGATGCTGCTGCTGCGGGCCGG 

GCTGCTTGCCCTGGCTGCTCTCTGCCTGCTCCGGGTGCCCGGGGCTCGGGCTGCAGCCTGTGAG 

CCCGTCCGCATCCCCCTGTGCAAGTCCCTGCCCTGGAACATGACTAAGATGCCCAACCACCTGC 

ACCACAGCACTCAGGCCAACGCCATCCTGGCCATCGAGCAGTTCGAAGGTCTGCTGGGCACCC 

ACTGCAGCCCCGATCTGCTCTTCTTCCTCTGTGCCATGTACGCGCCCATCTGCACCATTGACTTC 

CAGCACGAGCCCATCAACCCCTGTAAGTCTGTGTGCGAGCGGGCCCGGCAGGGCTGTGAGCCC 

ATACTCATCAAGTACCGCCACTCGTGGCCGGAGAACCTGGCCTGCGAGGAGC TGCCA GTGTAC 

GACAGGGGCGTGTGCATCTCTCCCGAGGCCATCGTTACTGCGGACGGAGCTGATTTTCCTATGG 

ATTCTAGTAACGGAAACTGTAGAGGGGCAAGCAGTGAACGCTGTAAATGTAAGCCTATTAGAG 

CTACACAGAAGACCTATTTCCGGAACAATTACAACTATGTCATTCGGGCTAAAGTTAAAGAGAT 

AAAGACTAAGTGCCATGATGTGACTGCAGTAGTGGAGGTGAAGGAGATTCTAAAGTCCTCTCT 

GGTAAACATTCCACGGGACACTGTCAACCTCTATACCAGCTCTGGCTGCCTCTGCCCTCCACTT 

AATGTTAATGAGGAATATATCATCATGGGCTATGAAGATGAGGAACGTTCCAGATTACTCTTGG 

TGGAAGGCTCTATAGCTGAGAAGTGGAAGGATCGACTCGGTAAAAAAGTTAAGCGCTGGGATA 

TGAAGCTTCGTCATCTTGGACTCAGTAAAAGTGATTCTAGCAATAGTGATTCCACTCAGAGTCA 

GAAGTCTGGCAGGAACTCGAACCCCCGGCAAGCACGCAACTAAATCCCGAAATACAAAAAGTA 

ACACAGTGGACTTCCTATTAAGACTTACTTGCATTGCTGGACTAGCAAAGGAAAATTGCACTAT 

TGCACATCATATTCTATTGTTTACTATAAAAATCATGTGA 

TTTGGTTTCTGCTTCTCTCTTCTCTCAACCCCTTTGTAATGGTTTGGGGGCAG 

TTGTGAGTTTTCTATTTCACTAATCATGAGAAAAACTGTTCTTTTGCAAT 

ATGCTGTTA 

Figure 44 

CAGCGGCCGCTGAATTCTAGGGCGGGTTCGCGCCCCGAAGGCTGAGAGCTGGCGCTGCTCGTG 
CCCTGTGTGCCAGACGGCGGAGCTCCGCGGCCGGACCCCGCGGCCCCGCTTTGGTGCCGACTGG 
AGTTTGGGGGAAGAAACTCrCCTGCGCCCCAGAAGATTTCTTCCTCGGCGAAGGGACAGCGAA 
- AGATGAGGGTGGCAGGAAGAGAAGGCGCTTTCTGTCTGCCGGGGTCGCAGCGCGAGAGGGCA 
GTGCCATGTTGCTCTCCATCCTAGTGGCGCTGTGCCTGTGGCTGCACCTGGCGCTGGGCGTGCG 
CGGCGCGCCCTGCGAGGCGGTGCGCATCCCTATGTGCCGGCACATGCCCTGGAACATCACGCG 
GATGCCCAACCACCTGCACCACAGCACGCAGGAGAACGCCATCCTGGCCATCGAGCAGTACGA 
GGAGCTGGTGGACGTGAACTGCAGCGCCGTGCTGCGCT^CTTCTTCTGTGCCATGTACGCGCCC 
ATTTGCACCCTGGAGTTCCTGCACGACCCTATCAAGCCGTGCAAGTCGGTGTGCCAACGCGCGC 
GCGACGACTGCGAGCCCCTCATGAAGATGTACAACCACAGCTGGCCCGAAAGCCTGGCCTGCG 
ACGAGCTGCCTGTCTATGACCGTGGCGTGTGCATTTCGCCTGAAGCCATCGTCACGGACCTCCC 
GGAGGATGTTAAGTGGATAGACATCACACCAGACATGATGGTACAGGAAAGGCCTCTTGATGT 
TGACTGTAAACGCCTAAGCCCCGATCGGTGCAAGTGTAAAAAGGTGAAGCCAACTTTGGCAAC 
GTATCTCAGCAAAAACTACAGCTATGTTATTCATGCCAAAATAAAAGCTGTGCAGAGGAGTGG 
CTGCAATGAGGTCACAACGGTGGTGGATGTAAAAGAGATCTTCAAGTCCTCATCACCCATCCCT 
CGAACTCAAGTCCCGCTCATTACAAATTCTTCTTGCCAGTGTCCACACATCCTGCCCCATCAAG 
ATGTTCTCATCATGTGTTACGAGTGGCGTTCAAGGATGATGCTTCTTGAAAATTGCTTAGTTGAA 
AAATGGAGAGATCAGCTTAGTAAAAGATCCATACAGTGGGAAGAGAGGCTGCAGGAACAGCG 
GAGAACAGTTCAGGACAAGAAGAAAACAGCCGGGCGCACCAGTCGTAGTAATCCCCCCAAACC 
AAAGGGAAAGCCTCCTGCTCCCAAACCAGCCAGTCCCAAGAAGAACATTAAAACTAGGAGTGC 
CCAGAAGAGAACAAACCCGAAAAGAGTGTGAGCTAACTAGTTTCCAAAGCGGAGACTTCCGAC 
TTCCTTACAGGATGAGGCTGGGCATTGCCTGGGACAGCCTATGTAAGGCCATGTGCCCCTTGCC 
CTAACAACTCACTGCAGTGCTCTTCATAGACACATCTrGCAGCATTTTTCTTAAGGCrATC 
AGTTTTTCTTTGTAAGCCATCACAAGCCATAGTGGTAGGTTTGCCCTTTGGTACAGAAGGTGAG 
TTAAAGCTGGTGGAAAAGGCTTATTGCATTGCATTCAGAGTAACCTGTGTGCATACTCTAGAAG 
AGTAGGGAAAATAATGCTTGTTACAATTCGACCTAATATGTGCATTGTAAAATAAATGCCATAT 
TTCAAACAAAACACGTAATTTTTTTACAGTATGTTTTATTACCTT^ 

GTTAGTGATGTITrAAAATGTGATGAAAATATAATGTTTTTAAGAAGGAACAGTAGTGGAAtGA 
ATGTTAAAAGATCTTTATGTGTTTATGGTCTGCAGA^ 
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GAAAAATTAGAGAAGTAGCATATGGAAAATTATAATGTGTT^ 

TGTTTTTAGCTAGAAACTTAAAAACAAAAATAATAATAAAGAAAAATAAATAAAAAGGAGAGG 
CAGACAATGTCTGGATTCCTGTTTTTTGGTTAC^^ 

CACCCTCTTAAGCAGCACCAGAAACAGTGAGTTTGTCTGTACCATTAGGAGTTAGGTACTAATT 
AGTTGGCTAATGCTCAAGTATTTTATACCCACAAGAGAGGTATGTCACTCATCTTACTTCCCAG 
GACATCCACCCrGAGAATAATITGACAAGCITAAAAATGGCCTTCATGTGAGTGCCAAATTTTG 
TTTTTCITCATTrAAATATTTC 

GAGAGGAAAGTTGAGTTCCACCTCTGAAATGAGAATTACTTGACAGTTGGGATACTTTAATCAG 
AAAAAAAGAACTTATTTGCAGCATTTTATCAACAAATTTCATAATTGTGGACAATTGGAGGCAT 
TTATTTTAAAAAACAATTTTATTGGCCTTTTGCTAACAC 

CAATAAATGCACAACGCCCAAAGGAAATAAAATCCTATCTAATCCTACTCTCCACTACACAGA 

GGTAATCACTATTAGTATTTTGGCATATTATTCTCCAGGTGTTTGCITATGCACITATAAAATGA 

TTTGAACAAATAAAACTAGGAACCTGTATACATGTGTTTCATAACCrGCCTCCITTGCTTGGCCC 

TTTATrGAGATAAGTTTTCCTGTCAAGAAAGCAGAAACCATCTCATTTCTAACAGCTGTGTTATA 

TTCCATAGTATGCATTACTCAACAAACTGTTGTGCrATTGGATACTTAGGTGGTTTCTTCACTGA 

CAATACTGAATAAACATCTCACCGGAATTC 

Figure 45 

aagcttgatatcgaattcgcggccgcgtcgacgggaggcgccaggatcagtcggggcacccgc 

agcgcaggctgccacccacctgggcgacctccgcggcggcggcggcggcggctgggtagagtc 

agggccgggggcgcacgccggaacacctgggccgccgggcaccgagcgtcggggggctgcgc 

ggcgcgaccctggagagggcgcagccgatgcgggcggcggcggcggcggggggcgtgcggac 

ggccgcgctggcgctgrctgctgggggcgctgcactgggcgccggcgcggtgcgaggagtacga 

ctactatggctggcaggccgagccgctgcacggccgctcctactccaagccgccgcagtgcctt 

gacatccctgccgacctgccgctctgccacacggtgggctacaagcggatgcggctgcccaa.ee 

tgctggagcacgagagcctggccgaagtgaagcagcaggcgagcagctggctgccgctgctgg 

ccaagcgctgccactcggatacgcaggtcttcctgtgctcgctctttgcgcccgtctgtctcgac 

cggccCatctacccgtgccgctcgctgtgcgaggccgtgcgcgccggctgcgcgccgctcatgg 

aggccracggcttcccctggcctgagatgctgcacrgccacaagttccccctggacaacgacct 

ctgcatcgccgtgcagttcgggcacctgcccgccaccgcgcctccagtgaccaagatctgcgcc 

cagtgtgagatggagcacagtgctgacggcctcatggagcagatgtgctccagtgactttgtg 

gtcaaaatgcgcatcaaggagatcaagatagAgaatggggaccggaagctgattggagccca 

gaaaaagaagaagctgctcaagccgggccccctgaagcgcaaggacaccaagcggctggtgc 

tgCacatgaagaatggcgcgggctgcccctgcccacagctggacagcctggcgggcagcttcc 

tggtcatgggccgcaaagtggatggacagctgctgctcatggccgtctaccgctgGgacaaga 

agaataaggagatgaagtttgcagtcaaattcatgttcrcctacccctgctccctctactaccct 

ttcttctacggggcggcagagccccactgaagggcact 

gcttgccctctg<jccccg<:cccaacitccaggctgacccggcccractggagg gtgttt ^ 

aatgttgttactggcacaaggcctaagggatgggcacggagcccagg 

ggggtcctggggtccctgggatgttgggcttcctctctcaggagcagggctt(^catctgggt 

gaagacctcagggtctcagaaagtaggcaggggaggagagggtaagggaaaggtggaggggc 

tcagggcaccctgaggcggaggtttcagagtagaaggtgatgtcagctccagctcccctctgtc 

ggtggtggggcctcaccitgaagagggaagtctcaatattaggctaagctatttgggaaagttc 

tccccaccgcccctgtacgcgtcatcctagccccccttaggaaaggagttagggtctcagtgcc 

tccagccacaccccctgccttccccagcttgcccatttccctgccccaag^cccagagctccccc 

cagactggagagcaagcccagcccagcctcggcatagacccccttctggtccgcccgtggctcg 

attcccgggattcattccrcagcctctgcttct 

TGAGGCCATAGGTACTAGACAACCAATACATGCAGGGTTGGGTTTTCTAAIllllTTAAClllll 
AATTAAATCAAAGGTCGACGCGCGGCCGCGGAATTCCTGCAGCCCGGGGGATCCCCGGGTACC 

GAGCTCGAATTC 
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ATGCATCTCCTCTTATTTCAGCTGCTGGTACTCCTGCCTCTAGGAAAGACCACACGGCACCAGG 

ATGGCCGCCAGAATCAGAGTTCTCTTTCCCCCGTACTCCTGCCAAGGAATCAAAGAGAGCTTCC 

CACAGGCAACCATGAGGAAGCTGAGGAGAAGCCAGATCTGTTTGTCGCAGTGCCACACCTTGT 

AGCCACCAGCCCTGCAGGGGAAGGCCAGAGGCAGAGAGAGAAGATGCTGTCCAGATTTGGCA 

GGTTCTGGAAGAAGCCTGAGAGAGAAATGCATCCATCCAGGGACTCAGATAGTGAGCCCTTCC 

CACCTGGGACCCAGTCCCTCATCCAGCCGATAGATGGAATGAAAATGGAGAAATCTCCTCTTCG 

GGAAGAAGCCAAGAAATTCTGGCACCACTTCATGTTCAGAAAAACTCCGGCTTCTCAGGGGGT 

CATCTTGCCCATCAAAAGCCATGAAGTACATTGGGAGACCTGCAGGACAGTGCCCTTCAGCCA 

GACTATAACCCACGAAGGCTGTGAAAAAGTAGTTGTTCAGAACAACCTTTGCTTTGGGAAATGC 

GGGTCrGTTCATTTTCCTGGAGCCGCGCAGCACTCCCATACCTCCTGCrCTCACTGTTTGCCTGC 

CAAGTTCACCACGATGCACTTGCCACrGAACTGCACTGAACTTTCCTCCGTGATCAAGGTGGTG 

ATGCTGGTGGAGGAGTGCCAGTGCAAGGTGAAGACGGAGCATGAAGATGGACACATCCTACAT 

GCTGGCTCCCAGGATTCCTTTATCCCAGGAGTTTCAGCTTGA 



Figure 47 

CGGCACGGTTTCGTGGGGACCCAGGCTTGCAAAGTGACGGTCATITrCTCTTTCTTTCT 

TGAGTCCTrCTGAGATGATGGCTCTGGGCGCAGCGGGAGCTACCCGGGTCTTTGTCGCGATGGT 

AGCGGCGGCTCTCGGCGGCCACCCTCTGCTGGGAGTGAGCGCCACCTTGAACTCGGTTCTCAAT 

TCCAACGCTATCAAGAACCTGCCCCCACCGCTGGGCGGCGCTGCGGGGCACCCAGGCTCTGCA 

GTCAGCGCCGCGCCGGGAATCCTGTACCCGGGCGGGAATAAGTACCAGACCATTGACAACTAC 

CAGCCGTACCCGTGCGCAGAGGACGAGGAGTGCGGCACTGATGAGTACTGCGCTAGTCCCACC 

CGCGGAGGGGACGCAGGCGTGCAAATCTGTCTCGCCTGCAGGAAGCGCCGAAAACGCTGCATG 

CGTCACGCTATGTGCTGCCCCGGGAATTACTGCAAAAATGGAATATGTGTGTCTTCTGATCAAA 

ATCATTTCCGAGGAGAAATTGAGGAAACCATCACTGAAAGCTTTGGTAATGATCATAGCACCTT 

GGATGGGTATTCCAGAAGAACCACCTTGTCTTCAAAAATGTATCACACCAAAGGACAAGAAGG 

TTCTGTTTGTCTCCGGTCATCAGACTGTGCCTCAGGATTGTGTTGTGGTAGACACTTCTGGTCCA 

AGATCTGTAAACCTGTCGTGAAAGAAGGTCAAGTGTGTACCAAGCATAGGAGAAAAGGCTCTC 

ATGGACrAGAAATATTCCAGCGTTGTTACTGTGGAGAAGGTCTGTCTTGCCGGATACAGAAAGA 

TCACCATCAAGCCAGTAATTCTTCTAGGCTTCACACTTGTCAGAGACACTAAACCAGCTATCCA 

AATGCAGTGAACTCCTTTTATATAATAGATGCTATGAAAACCTTTTATGACCTTCATCAACTCAA 

TCCTAAGGATATACAAGTTCTGTGGT1TCAGTTAAGGATTCCAATAACACGTTCCAAAAACCTG 

GAGTGTAAGAGCTTTGTITCTTTATGGAACTCCCCTGTGATTGCAG TAAAT TACTGTATTGTAAA 

TTCTCAGTGTGGCACTTACCTGTAAATGCAATGAAACTTTTAATTAlllllCTAAAGGTGCTGCA 

CTGCCTATTTTTCCTCTTGTTATGTAAATTTTTGTACACATTGATTGT^ 

TTCTATATTGAACTGAAGTAAATCATTTCAGCTTATAGTTCTTAAAAGCATAACCCTTTACCCCA 
TITAATTCTAGAGTCTAGAACGCAAGGATCTCTTGGAATGACAAATGATAGGTACCTAAAATGT 
AACATGAAAATACTAGCTTATTTTCTGAAATGTACTATCTTAATGCTTAAATTATATTTCCC^ 
AGGCTGTGATAGTTTTTGAAATAAAATTTAACATTTAATATCATGAAATGTTATAAGTAGACAT 



Figure 48 

GCGGGTCTCGCTTGGGTTCCGCTAATTTCTGTCCTGAGGCGTGAGACTGAGTTCATAGGGTCCT 

GGGTCCCCGAACCAGGAAGGGTTGAGGGAACACAATCTGCAAGCCCCCGCGACCCAAGTGAGG 

GGCCCCGTGTTGGGGTCCTCCCTCCCTTTGCATTCCCACCCCrCCGGGCTTTGCGTCTTCCTGGG 

GACCCCCTCGCCGGGAGATGGCCGCGTTGATGCGGAGCAAGGATTCGTCCTGCTGCCTGCTCCT 

ACTGGCCGCGGTGCTGATGGTGGAGAGCTCACAGATCGGCAGTTCGCGGGCCAAACTCAACTC 

CATCAAGTCCTCTCTGGGCGGGGAGACGCCTGGTCAGGCCGCCAATCGATCTGCGGGCATGTAC 

CAAGGACTGGCATTCGGCGGCAGTAAGAAGGGCAAAAACCTGGGGCAGGCCTACCCTTGTAGC 

AGTGATAAGGAGTGTGAAGTTGGGAGGTATTGCCACAGTCCCCACCAAGGATCATCGGCCTGC 

ATGGTGTGTCGGAGAAAAAAGAAGCGCTGCCACCGAGATGGCATGTGCTGCCCCAGTACCCGC 

TGCAATAATGGCATCTGTATCCCAGTTACTGAAAGCATCTTAACCCCTCACATCCCGGCTCTGG 

ATGGTACTCGGCACAGAGATCGAAACCACGGTCATTACTCAAACCATGACTTGGGATGGCAGA 

ATCTAGGAAGACCACACACTAAGATGTCACATATAAAAGGGCATGAAGGAGACCCCTGCCTAC 

GATCATCAGACTGCATTGAAGGGTTTTGCTGTGCTCGTCATrTCTGGACCAAAATCTGCAAACC 
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AGTGCTCCATCAGGGGGAAGTCTGTACCAAA.CAACGCAAGAAGGGTTCTCATGGGCTGGAAAT 

TITCCAGCGTTGCGACTGTGCGAAGGGCCrGTCTTGCAAAGTATGGAAAGATGCCACCTACTCC 

TCCAAAGCCAGACTCCATGTGTGTCAGAAAATTTGATCACCATTGAGGAACATCATCAATTGCA 

GACTGTGAAGTTGTGTATTTAATGCATTATAGCATGGTGGAAAATAAGGTTCAGATGCAGAAG 

AATGGCTAAAATAAGAAACGTGATAAGAATATAGATGATCAC 



Figure 49 

CTATCACAATGAGACCAACACAGACACGAAGGTTGGAAATAATACCATCCATGTGCACCGAGA 

AATTCACAAGATAACCAACAACCAGACTGGACAAATGGTCTTTTCAGAGACAGTTATCACATCT 

GTGGGAGACGAAGAAGGCAGAAGGAGCCACGAGTGCATCATCGACGAGGACTGTGGGCCCAG 

CATGTACTGCCAGTTTGCCAGCTTCCAGTACACCTGCCAGCCATGCCGGGGCCAGAGGATGCTC 

TGCACCCGGGACAGTGAGTGCTGTGGAGACCAGCTGTGTGTCTGGGGTCACTGCACCAAAATG 

GCCACCAGGGGCAGCAATGGGACCATCTGTGACAACCAGAGGGACTGCCAGCCGGGGCTGTGC 

TGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGCACACCCCTGCCCGTGGAGGGCGAGCTTT 

GCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCACCTGGGAGCTAGAGCCTGATGGAGCCTT 

GGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGCCCCACAGCCACAGCCTGGTGTATGTG 

TGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGGGAGATCCTGCTGCCCAGAGAGGTC 

CCCGATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCGCCAGGAGCTGGAGGACCTGGAG 

AGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTGCCGCCGCTGCACTGCTGGGA 

GGGGAAGAGATTTAGATCTGGACCAGGCTGTGGGTAGATGTGCAATAGAAATAGCTAATTTAT 

TTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTT^ 

TTTCCCCTCTGGCITGACAGCATGAGGTGTTGTGCATTTGTTCAGCTCCCCCAGGCTGTTCTCCA 

GGCTrCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCAGGAGCAGTTTGCCACC 

CCTGTCCAGATTATTGGCTGCTITGCCTCTACCAGTTGGCAGACAGCCGTTTGTTCTACATGGCT 

TTGATAATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTCTGATTGGTTTTGGGG 

AAATGTGGAGAAGAGTGCCCTGCITrGCAAACATCAACCTGGCAAAAATGCAACAAATGAATT 

TTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGCAGATGAAATGTTC 

TGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCCTACCTCTGTGCCA 

GGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGGAGGGGGTCATTG 

TTCTGCTCGTCCATCAGGGATCrCAGAGGNCTCAGAGACTGCAAGCTGGTTGCCCAAGTCACAC 

AGCTAGTGAAGACeAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCTCTCCACTAC 

CCCACACCAGCCTTGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGATTTTTCTTT 

TGAGGCATGCACATCTGGAATTAAGGTCAAACTAATTCTCACATCGCTCTAAAAGTAAACTACT 

GTTAGGAACAGCAGTGTTCTCACAGTGTGGGGCAGCCGTCCTTCTAATGAAGACAATGATATTG 

ACACTGTCCCTCITrGGCAGTTGCATTAGTAACTTTGAAAGGTATATGACTGAGCGTAGCATAC 

AGGTTAACCTGCAGAAACAGTACTTAGGTAATTGTAGGGCGAGGATTATAAATGAAATTTGCA 

AAATCACTTAGCAGCAACTGAAGACAATTATCAACCACGTGGAGAAAATCAAACCGAGCAGGG 

CTGTGTGAAACATGGTTGTAATATGCGACTGCGAACACTGAACTCTACGCCACTCCACAAATGA 

TGTTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTG 

CACATGATTGTATAAGCATGCirTCTTTGAGTTTTAAATTATGTATAAACATAAGT^ 

AAATCAAGCATAAATCAC 



Figure 50 

AGACGACGTGCTGAGCTGCCAGCTTAGTGGAAGCTCTGCTCTGGGTGGAGAGCAGCCTCGCiTT 

GGTGACGCACAGTGCTGGGACCCTCCAGGAGCCCCGGGATTGAAGGATGGTGGCGGCCGTCCT 

GCTGGGGCTGAGCTGGCTCTGCTCTCCCCTGGGAGCTCTGGTCCTGGACTTCAACAACATCAGG 

AGCTCTGCTGACCTGCATGGGGCCCGGAAGGGCTCACAGTGCCTGTCTGACACGGACTGCAAT 

ACCAGAAAGTTCTGCCTCCAGCCCCGCGATGAGAAGCCGTTCTGTGCTACATGTCGTGGGTTGC 

GGAGGAGGTGCCAGCGAGATGCCATGTGCTGCCCTGGGACACTCTGTGTGAACGATGTTTGTAC 

TACGATGGAAGATGCAACCCCAATATTAGAAAGGCAGCTTGATGAGCAAGATGGCACACATGC 

AGAAGGAACAACTGGGCACCCAGTCCAGGAAAACCAACCCAAAAGGAAGCCAAGTATTAAGA 

AATCACAAGGCAGGAAGGGACAAGAGGGAGAAAGTTGTCTGAGAACTTTTGACTGTGGCCCTG 
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GACTTTGCTGTGCTCGTCATTTTTGGAC 

CTGCrCCAGAAGAGGGCATAAAGACACrGGTCAAGCTGCAGAAATCTTCCAGCGTTGCGACTGT 
GGCCCTGGACTACTGTGTCGAAGCCAATTGACCAGCAATCGGCAGCATGCTCGATTAAGAGTAT 
GCCAAAAAATAGAAAAGCTATAAATATTTCAAAATAAAGAAGAATCCACATTGC 



Figure 51 

AGGCAGAATACTTCTATGAATTCCTGTCCTTGCGCTCCCTGGATAAAGGCATCATGGCAGATCC 

AACCGTCAATGTCCCTCTGCTGGGAACAGTGCCrCACAAGGCATCAGTTGTTCAAGTTGGTTTC 

CGATGTCTTGGAAAACAGGATGGGGTGGCAGCATTTGAAGTGGATGTGATTGTTATGAATTCTG 

AAGGCAACACCATTCTCCAAACACCTCAAAATGCTATCTTCITTAAAACATGTCAACAAGCTGA 

GTGCCCAGGCGGGTGCCGAAATGGAGGCTTTTGTAATGAAAGACGCATCTGCGAGTGTCCTGA 

TGGGTTCCACGGACCrCACTGTGAGAAAGCCCTTTGTACCCCACGATGTATGAATGGTGGACTT 

TGTGTGACrCCrGGTTTCTGCATCTGCCCACCTGGATTCTATGGAGTGAACTGTGACAAAGCAA 

ACTGCTCAACCACCTGGriTAATGGAGGGACCTGTTTCTACCCTGGAAAATGTATTTGCCCrC 

GGACTAGAGGGAGAGCAGTGTGAAATCAGCAAATGCCCACAACCCTGTCGAAATGGAGGTAA 

ATGCATTGGTAAAAGCAAATGTAAGTGTTCCAAAGGTTACCAGGGAGACCTCTGTTCAAAGCCT 

gtcrgcgagcctggctgtggtgcacatggaacctgccatgaacccaacaaatgccaatgtcaa 

gaaggttggcatggaagacactgcaataaaaggtacgaagccagcctcatacatgccctgagc 

gcagcagcgcccagctcaggcagcacacgccttcacttaaaaaggccgaggagcggcggcatc 

caccrgaatccaattacatctggtgaactccgacatcrgaaacgttttaagttacacgaagttc 

atagcgittgttaaccittcatgtgtrgaatgttcaaataatgttcattacacttaaga^ 

gcctgaattttattagcitcatrataaatcact 

aagtacgtctgtagcatgatggtatagattttcttc^ 

atgtcaattgatcaggttaaaattttcagtgtgtagttggcagatattttcaaaattacaatgc 
atttatggtgtctgggggcaggggaacatcagaaaggttaaattgggcaaaaatgcgtaagtc 
acaagaatttggatggtgcagttaatgttgaagttacagcatttcagattttattgtcagatat 
ttagatgtttgttacatttttaaaaattgct 

ccttaccattattccagagattcagtattaaaaaaaaaaaaattacactgtggtagtggcattt 

aaacaatatmtatattctaaacacaatgaaatagggAatataatgtatgaacttt^ 

gcttgaagcaatataatatattgtaaacaaaacacagct 

TTTGTATGTATAAAATAAAGGTGCTGCTTTAGTTTTC 



Figure 52 , - \ 

ArGGGCATCGGGCGCAGCGAGGGGGGCCGCCGCGGGGCAGCCCTGGGCGTGCTGCTGGCGCTGGGCGCGG 

CGCTTCTGGCCGTGGGCTCGGCCAGCGAGTACGACTACGTGAGCTTCCAGTCGGACATCGGCCCGTACCA 

GAGCGGGCGCTTCTACACCAAGCCACCTCAGTGCGTGGACATCCGCGCGGACCTGCGGCTGTGCCACAAC 

GTGGGCTACAAGAAGATGGTGCTGCCCAACCTGCTGGAGCAGGAGACCATGGCGGAGGTGAAGCAGCAGG 

CCAGCAGCTGGGTGCCCCTGCTCAACAAGAACTGCCACGCCGGCACCCAGGTCTTCCTCTGCTCGCTCTT 

CGCGCCCGTCTGCCTGGACCGGCCCATCTACCCGTGTCGCTGGCTCTGCGAGGCCGTGCGCGACTCGTGC 

GAGCCGGTCATGCAGTTCTTCGGCTTCTACTGGCCCGAGATGCTTAAGTGTGACAAGTTCCCCGAGGGGG 

ACGTCTGCATCGCCATGACGCCGCCCAATGCCACCGAAGCCTCCAAGCCCCAAGGCACAACGGTGTGTCC 

TCCCTGTGACAACGAGTTGAAATCTGAGGCCATCATTGAACATCTCTGTGCCAGCGAGTTTGGGCTGAGT 

TTAAAGATGATTGTGGGTAGCTCCCATAACTCATGCTGCACGCTGGGTCCTTCTCATCCCAACTCCTCAA 

AGCGGCAGGAGCAGGAACTGGGGACTCCTGAGAGAAGGCTTGGATATGGCCTTTTATTACACTTCATCCA 

AGGAAATCTGCCCCCACCCTGTGCCCAGGCCCGATCACGCATGAGGCTAAAGACGGAGGCCACTCCGCTG 

GCTCTGGGTAGATCTGCCCCTGGACTGTTTGCCGACTGCCCGGAGCGCCCTCTGCCGGTCTGCAGCTTCC 

CACACCACACGGAAGAAGTGGGGAAACTGAGGATACATTCTTTCCTCCTCCAGGTAAAGGGATTCTCAAT 

GAAGGGCTTGTGTGCACCTTCCACACTTAGATACCTCTACTACCTGAAAACCAGCATGCAGCATGTACAT 

CAAGAGTACCAGGCACATAGTGCTCAAGTCTGGGCTAATATGCCACCTGCAGAGAGATGTAAAGATGAAG 

AAGACAAAGCCATGTTTTCAAAGTGA 
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GGCGGGTTCGCGCCCCGAAGGCTGAGAGCTGGCGCTGCTCGTGCCCTGTGTGCCAGACGGCGGAGCTCGG 
CGGCCGGACCCCGCGGCCCCGCTTTGCTGCCGACTGGAGTTTGGGGGAAGAAACTCTCCTGCGCCCCAGA 
AGATTTCTTCCTCGGCGAAGGGACAGCGAAAGATGAGGGTGGCAGGAAGAGAAGGCGCTTTCTGTCTGCC 
GGGGTCGCAGCGCGAGAGGGCAGTGCCATGTTCCTCTCCATCCTAGTGGCGCTGTGCCTGTGGCTGCACC 
TGGCGCTGGGCGTGCGCGGCGCGCCCTGCGAGGCGGTGCGCATCCCTATGTGCCGGCACATGCCCTGGAA 
CATCACGCGGATGCCCAACCACCTGCACCACAGCACGCAGGAGAA 

GAGGAGCTGGTGGACGTGAACTGCAGCGCCGTGCTGCGCTTCTTCTTCTGTGCCATGTACGCGCCCATTT 
GCACCCTGGAGTTCCTGCACGACCCTATCAAGCCGTGCAAGTC 

CGAGCCCCTCATGAAGATGTACAACCACAGCTGGCCCGAAAGCCTGGCCTGCGACGAGCTGCCTGTCTAT 
GACCGTGGCGTGTGCATTTCGCCTGAAGCCATCGTCACGGACCTCCCGGAGGATGTTAAGTGGATAGACA - 
TCACACCAGACATGATGGTACAGGAAAGGCCTCTTGATGTTGACTGTAAACGCCTAAGCCCCGATCGGTG 
CAAGTGTAAAAAGGTGAAG C CAACTTTGGCAACGTATCTCAGCAAAAAC 

AAAATAAAAGCTGTGCAGAGGAGTGGCTGCAATGAGGTCAGAACGGTGGTGGATGTAAAAGAGATCTTC^ 
AGTCCTCATCACCCATCCCTCGAACTCAAGTCCCGCTCATTACAAATTCTTCTTGCCAGTGTCCACACAT 
CCTGCCCCATCAAGATGTTCTCATCATGTGTTACGAGTGGCGTTCAAGGATGATGCTTCTTGAAAATTGC 
TTAGTTGAAAAATGGAGAGATCAGCTTAGTAAAAGATCCAT^ 

GGAGAACAGTTCAGGACAAGAAGAAAACAGCCGGGCGCAC CAGTCGTAGTAAT C C C C C CAAACCAAAGGG 
AAAGCCTCCTGCT C C C AAACCAGCC AGT C C CAAGAAGAACATTAAAACTAGGAGTGC CCAGAAGAGAACA 
AACCCGAAAAGAGTGTGAGC TAACTAGTTTC CAAAGCGGAGACTTC CGACTTC CTTACAGGATGAGGCTG 
GGCATTGCCTGGGACAGCCTATGTAAGGCCATGTGCCCCTTGCCCTAACAACTCACTGCAGTGCTCTTCA 
TAGACACATCTTGCAGCATTTTTCTTAAGGCTATGCTTG 

GTGGTAGGTTTGCCCTTTGGTACAGAAGGTGAGTTAAAGCTGGTGGAAAAGGCTTATTGCATTGCATTCA 
GAGTAACCTGTGTG CATACTC TAGAAGAGTAGGGAAAATAATGCTTGT TACAATT CGACCTAATATGTGC 
ATTGTAAAATAAATGC CATATTT CAAACAAAACACGTAATTTTTTTACAGTATGTTTTATTAC CTTTTGA 
TATCTGTTGTTGCAATGTTAGTGATGTTTTAAAATGTGATGAAAATATAATGTTTTTAAGAAGGAACAGT 
AGTGGAATGAATGTTAAAAGATCTTTATGTGTTTATGGTCTGCAGAAGGATTTTTGTGATGAAAGGGGAT 
TTTTTGAAAAATTAGAGAAGTAGCATATGGAAAATTATAATGTGTTTTTTTACCAATGACTTCAGTTTCT 
GTTT T T AGC TAGAAACTTAAAAA CAAAAAT AATAATAAA 

GTCTGGATTCCTGTTTTTTGGTTACCTGATTTCCATGATCATGATGCTTCTTGTCAACACCCTCTTAAGC 
AGCACCAGAAACAGTGAGTTTGTCTGTACCATTAGGAGTTAGGTACTAAT 

ATTTTATACCCACAAGAGAGGTATGTCACTCATCTTACTTCC CAGGACAT CCACC CTGAGAATAATTTGA 

CAAGCTTAAAAATGGCCTTCATGTGAGTGCCAAATTTTGTTTTTCTTCATTTT^AATATTTTCTTTGCCTA 

AATACATGTGAGAGGAGTTAAATATAAATGTACAGAGAGGAAAGTTGAGTTCCACCTCTGAAATGAGAAT 

TACTTGACAGTTGGGATACTTTAATCAGAAAAAAAGAACTTAT^ 

AkTTGTGGACAATTGGAGGCATTTATTTTAAAAAA^^ 

GTATTTTATAAGGGATTCAATAAATGCACAACGCCCAAAGGAAATAAAATCCTA 

ACTACACAGAGGTAATCACTATTAGTATTTTGGCATATTATTCTCCAGGTGTTTGCTTATGCAC 

AATGATTTGAACAAATAAAACTAGGAACCTGTATACATGTGTTTCATAACCTGCCTCCTTTGCTTGGCCC 

TTTATTGAGATAAGTTTTCCTGTCAAGAAAGC^GAAACCATCTCATTTCTAACAGCTGTGTTATATTC^ 

TAGTATGCATTACTCAACAAACTGTTGTGCTATTGGATACTTAGGTGGTTTCTTCACTGACAATACTGAA 

TAAACATCTCACCGGAATTC 

Figure 54 

GAGGCGCCTTGGGACCGCGTGGGAGCCGCAGCCGAACCGAGTAGGGACCGGGACCGCGCGGCGCCGCCG 

TCCCCGGCCGGGCCCGGCCCCCGCGAGCCGAGCGCGCGCCCCCGTCGCCCACCCGGGCGCGGCTGGATGC 

GGCGGGGTCCCCGCGGCGGCGACCCCCGGCCCCGAGCGCCCGGAGCGCCCAGAGGCGGCGTGCGGGGCC 

CGGGGACGCCGCGCCCTSTBGTGCGCCGAGGCGCGCCCCGAGACAGCCGGGGGCCCGCGCCGCAGCCGC 

CGCCCGCGCTGAGCCCCGGCCCGGCCCGCGGCCCGCGCCCGGCGGCAGC3SITGAGCCAGGCCGAGCTGTC 

CACCTGCTCCGCGCCGCAGACGCAGCGCATCTTCCAGGAGGCTGTGCGCNAGGGCAACACGCAGGAGCT 

GCAGTYGCTGCTGCAGAACATGACCAACTGCGAGTTCAACGTGAACTCGTTCGGGCCCGAGGGCCAGAC 

GGCGCTGCACCAGTCGGTCATCGTCGGCAACCTGGTGCTCGTGAAGCTGCTGGTCAAGTTCGGCGCCGAC 

ATCCGCCTGGCCAACCGCGACGGCTGGAGCGCGCTGCAMATCGCCGCGTTCGGTGGCCACCAGGACATC 

GTGCTCTATCTCATCACCAAGGCGAAGTACGCGGCCAGCGCSGGTGTATGCCCGCCGGGACCCCGGACCC 

CGGCCCTGCGCCCGCGTCGTCTCTGCTGTACCnTCCCGCCAACTACCTCGGTGCGCGCMCGGCTCGCAGG 

CCCCGCCAGAAGGCCCGTGGCAACGGCGAATACGGCGCGTGCGTCMCGGCCCCAGGGTC 
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